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(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 




(iii) HYPOTHETICAL,: NO 



(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv . zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Ser He Asn Val Asp He Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
! 5 10 15 

Ala Thr Leu Gly Glu Val Tyr Tyr Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

.(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv. zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr Pro Gly Xaa Ser 



Ser Phe Pro Thr Val Leu Gin Asn Tyr 
20 25 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 



(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
AACTTCTCCN AGNGTNGCNC CNGCTTGNAC CCA 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GATCCNTCTT TCCCNATTAC TGGNGAGGTT TA 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv. zebulon 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 



GAT CCG TCT TTC CCG ATT ACT GGG GAG GTT TAC ACT CCC GGA AAC 
Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr Pro Gly Asn 
15 10 15 



TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT 96 
Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn Leu Arg Phe Asn 
20 25 30 

GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT 144 
Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr Ala Glu His Val 
35 40 45 

TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA 192 
Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin Asn Arg Leu Leu 
50 55 60 

CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT 240 
Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu 
65 70 75 80 

ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG 2 88 
Thr Asn Thr Asn Gin Pro Phe Phe He Val Asp Met Phe Asn Leu Arg 
85 90 95 

TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG GTC CAA GCC GGC 3 36 
Ser He Asn Val Asp He Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
100 105 HO 

GCC ACC CTC GGA GAA GTT 354 
Ala Thr Leu Gly Glu Val 
115 

(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser 
15 10 15 

Ser Phe Pro Thr Val Leu Gin Asn Tyr He Arg Asn Leu Arg Phe Asn 
20 25 30 

Glu Thr Thr Thr Pro Lys Pro Phe Leu He He Thr Ala Glu His Val 
35 40 45 

Ser His He Gin Ala Ala Val Val Cys Gly Lys Gin Asn Arg Leu Leu 
50 55 60 

Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu 
65 70 75 B0 

Thr Asn Thr Asn Gin Pro . Phe Phe He Val Asp Met Phe Asn Leu Arg 
85 90 95 

Ser He Asn Val Asp He Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
100 105 HO 




Ala Thr Leu Gly Glu Val 
115 

(2) INFORMATION FOR SEQ ID NO: 7 ? : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function^ "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAGGCAGCTG TGGTTTGTGG C 



(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GTCCACAATG AAGAAGGGTT G 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO' 



(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 
ACGTAGATAT CGAACAAGAA ACCGC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 10: 
GCTTTACTAC ACGGGCTTCC CCAG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTGGGGAAGC CCGTGTAGTA AAGC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GGTACTCCAA CCACGGCGCT C 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
CGGGAAGTTG CAGAAGATTG GGTTG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GAGCAAGAGA AGAAGGAGAC 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 

(ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 21.. 1608 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATATCACATC TTCTTTCAAC ATG CAA ACT TCC ATT CTT ACT CTC CTT CTT 5 0 

Met Gin Thr Ser lie Leu Thr Leu Leu Leu 
1 5 10 

CTC TTG CTC TCA ACC CAA TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT 98 
Leu Leu Leu Ser Thr Gin Ser Ser Ala Thr Ser Arg Ser lie Thr Asp 
15 20 25 

CGC TTC ATT CAA TGT TTA CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA 146 
Arg Phe lie Gin Cys Leu His Asp Arg Ala Asp Pro Ser Phe Pro He 
30 35 40 

ACC GGA GAG GTT TAC ACT CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG 194 
Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser Ser Phe Pro Thr Val Leu 
45 50 55 

CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA 242 
Gin Asn Tyr He Arg Asn Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys 
60 65 70 

CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT 2 90 
Pro Phe Leu He He Thr Ala Glu His Val Ser His He Gin Ala Ala 
75 80 85 90 

GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT 33 8 
Val Val Cys Gly Lys Gin Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly 
95 100 105 

GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC 3 86 
Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro 
110 115 120 

TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC 434 
Phe Phe He Val Asp Met Phe Asn Leu Arg Ser He Asn Val Asp He 
125 130 135 

GAA CAA GAA ACC GCA TGG GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG 482 
Glu Gin Glu Thr Ala Trp Val Gin Ala Gly Ala Thr Leu Gly Glu Val 
140 145 150 

TAC TAT CGA ATA GCG GAG AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG 530 
Tyr Tyr Arg He Ala Glu Lys Ser Asn Lys His Gly Phe Pro Ala Gly 
155 160 165 170 

GTT TGT CCA ACG GTT GGC GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT 578 
Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly Gly Tyr 
175 180 185 

GGT AAT TTG ATG AGA AAA TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT 626 
Gly Asn Leu Met Arg Lys Tyr Gly Leu Ser Val Asp Asn He Val Asp 
190 I 95 200 



GCT CAA ATA ATA GAT GTG AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG 
Ala Gin lie lie Asp Val Asn Gly Lys Leu Leu Asp Arg Lys Ser Met 
205 210 215 



674 



GGT GAG GAT TTG TTT TGG GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT 722 
Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly Gly Val Ser Phe 
220 225 230 

GGT GTG GTT CTA GCC TAC AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT 770 
Gly Val Val Leu Ala Tyr Lys lie Lys Leu Val Arg Val Pro Glu Val 
235 240 245 250 

GTG ACC GTG TTT ACC ATT GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC 818 
Val Thr Val Phe Thr lie Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr 
255 260 265 

ATC GCG GAA CGA TGG GTA CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT 86 6 
lie Ala Glu Arg Trp Val Gin Val Ala Asp Lys Leu Asp Arg Asp Leu 
270 275 280 

TTC CTT CGA ATG ACC TTT AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG 914 
Phe Leu Arg Met Thr Phe Ser Val He Asn Asp Thr Asn Gly Gly Lys 
285 290 295. 

AC A GTC CGT GCT ATC TTT CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT 962 
Thr Val Arg Ala He Phe Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn 
300 305 310 

CTT GTT ACA CTT TTG AAT AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA 1010 
Leu Val Thr Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu 
315 320 325 330 

TCG GAT TGT ACT GAA ATG AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG 1058 
Ser Asp Cys Thr Glu Met Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr 
335 340 345 

GGC TTC CCC AGT GGT ACT CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT 1106 
Gly Phe Pro Ser Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro 
350 355 360 

CAA AGA CTC AAC CCA TTC AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT 1154 
Gin Arg Leu Asn Pro Phe Lys He Lys Ser Asp Tyr Val Gin Asn Pro 
365 370 375 

ATT TCT AAA CGA CAG TTC GAG TTC ATC TTC GAA AGG CTG AAA GAA CTT 12 02 
He Ser Lys Arg Gin Phe Glu Phe He Phe Glu Arg Leu Lys Glu Leu 
380 385 390 

GAA AAC CAA ATG TTG GCT TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA 12 50 
Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu 
395 400 405 410 

ATA TCC GAA TTC GCA AAG CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG 12 98 
He Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn He Ala 
415 420 425 

AAA ATT CAA TAC GAA GTA AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA 1346 
Lys He Gin Tyr Glu Val Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu 
430 435 440 



AAT CGT TAC TTG AAT TTC ACA AGG CTG ATG TAT GAT TAC ATG ACC CCA 
Asn Arg Tyr Leu Asn Phe Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro 
445 450 455 



1394 



TTT GTG TCG AAA AAC CCT AGA AAA GCA TTT TTG AAC TAT AGG GAT TTG 1442 
Phe Val Ser Lys Asn Pro Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu 
460 465 470 

GAT ATT GGT ATC AAC AGC CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG 1490 
Asp lie Gly lie Asn Ser His Gly Arg Asn Ala Tyr Thr Glu Gly Met 
475 480 485 490 

GTT TAT GGG CAC AAG TAT TTC AAA GAG ACA AAT TAC AAG AGG CTA GTA 153 8 
Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val 
495 500 505 

AGT GTG AAG ACT AAA GTT GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA 1586 
Ser Val Lys Thr Lys Val Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin 
510 515 520 

AGC ATC CCA ACT TTG TCA TCT T GAAGAACGTA CATATATAAA TAAATACCTT 163 8 
Ser lie Pro Thr Leu Ser Ser 
525 

TGTGCATGGT ATTTTCAGGG TGTTAAAGTG ATATTCAGAT ATTTATGATA GAATTTTGAC 16 98 
TTGTATTTTA TACAATCAAA ATTGTATGGT TCTCCGAATT TCTCTTTTTA ATTCTGAAAA 1758 
ATACATATTA GTATTGTCAA AAAAAA 1784 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 9 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Gin Thr Ser He Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 

Ser Ser Ala Thr Ser Arg Ser He Thr Asp Arg Phe He Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr 
35 40 45 

Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr He Arg Asn 
50 55 60 



Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu He He Thr 
65 70 " 80 

Ala Glu His Val Ser His He Gin Ala Ala Val Val Cys Gly Lys Gin 

90 95 



85 



Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 ' 105 110 



Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 
130 135 140 

Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

Tyr Gly Leu Ser Val Asp Asn lie Val Asp Ala Gin lie lie Asp Val 
195 200 205 

Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

Ala lie Thr Gly Gly Gly Gly Val Ser Phe Gly val Val Leu Ala Tyr 
225 230 235 240 

Lys He Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr He 
245 250 255 

Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr He Ala Glu Arg Trp Val 
260 265 270 

Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

Ser Val He Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala He Phe 
290 295 300 

Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 

Lys He Lys Ser Asp Tyr Val Gin Asn Pro He Ser Lys Arg Gin Phe 
370 375 380 

Glu Phe He Phe Glu Arg Leu Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 

Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu He Ser Glu Phe Ala Lys 
405 410 415 



• 



Pro Phe Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 

Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 

Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
450 455 460 

Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu Asp lie Gly lie Asn Ser 
465 470 475 480 

His Gly Arg Asn Ala Tyr Thr Glu Gly Met Val Tyr Gly His Lys Tyr 
485 490 495 

Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val Ser Val Lys Thr Lys Val 
500 505 510 

Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin Ser He Pro Thr Leu Ser 
515 520 525 

Ser 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17: 
CCGCCATGGA GACTTCCATT CTTACTC 

(2) INFORMATION FOR SEQ ID NO : 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCCGGATCCT CAAGATGACA AAGTTGGGAT GCT 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1590 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG GAG ACT TCC ATT CTT ACT CTC CTT CTT CTC TTG CTC TCA ACC CAA 48 
Met Glu Thr Ser He Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
! 5 10 15 

TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT CGC TTC ATT CAA TGT TTA 96 
Ser Ser Ala Thr Ser Arg Ser He Thr Asp Arg Phe He Gin Cys Leu 
20 25 30 

CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA ACC GGA GAG GTT TAC ACT 144 
His Asp Arg Ala Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr 
35 40 45 

CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC 192 
Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr He Arg Asn 
50 " 60 

CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA 
Leu Arg Phe Asn Glu Thr Thr. Thr Pro Lys Pro Phe Leu He He Thr 
65 70 75 80 

GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA 
Sa K±l val Ser His He Gin Ala Ala Val Val Cys Gly Lys Gin 

85 90 95 

AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT 
A^n Arg leu Su Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 11° 

CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG 
IZ Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe He Val Asp Met 
115 I 20 125 

TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG 
p£e En leu Arg Ser He Asn Val Asp He Glu Gin Glu Thr Ala Trp 
130 "5 140 



240 



288 



336 



384 



432 



GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG TAC TAT CGA ATA GCG GAG 480 
Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG GTT TGT CCA ACG GTT GGC 528 
Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT GGT AAT TTG ATG AGA AAA 576 
Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT GCT CAA ATA ATA GAT GTG 624 
Tyr Gly Leu Ser Val Asp Asn He Val Asp Ala Gin He He Asp Val 
195 200 205 

AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG GGT GAG GAT TTG TTT TGG 672 
Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT GGT GTG GTT CTA GCC TAC 720 
Ala He Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 

AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT GTG ACC GTG TTT ACC ATT 768 
Lys He Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr He 
245 250 255 

GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC ATC GCG GAA CGA TGG GTA 816 
Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr He Ala Glu Arg Trp Val 
260 265 270 

CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT TTC CTT CGA ATG ACC TTT 864 
Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG ACA GTC CGT GCT ATC TTT 912 
Ser Val He Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala He Phe 
290 295 300 

CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT CTT GTT ACA CTT TTG AAT 960 
Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA TCG GAT TGT ACT GAA ATG 1008 
™ Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG GGC TTC CCC AGT GGT ACT 1056 
Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT CAA AGA CTC AAC CCA TTC 1104 
Pro" Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 

AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT ATT TCT AAA CGA CAG TTC 1152 
™s zlt Lys Ser Asp Tyr Val Gin Asn Pro He Ser Lys Arg Gin Phe 
370 375 380 



GAG TTC ATC TTC GAA AGG ATG AAA GAA CTT GAA AAC CAA ATG TTG GCG 
Glu Phe lie Phe Glu Arg Met Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 



1200 



TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA ATA TCC GAA TTC GCA AAG 124 8 
Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu lie Ser Glu Phe Ala Lys 
405 410 415 

CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG AAG ATT CAA TAC GAA GTA 12 96 
Pro Phe Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 

AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA AAT CGT TAC TTG AAT TTC 1344 
Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 

ACA AGG CTG ATG TAT GAT TAC ATG ACT CCA TTT GTG TCG AAA AAC CCT 13 92 
Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
450 455 460 

AGA GAA GCA TTT TTG AAC TAT AGG GAT TTG GAT ATT GGT ATC AAC AGC 144 0 
Arg Glu Ala Phe Leu Asn Tyr Arg Asp Leu Asp lie Gly lie Asn Ser 
465 470 475 480 

CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG GTT TAT GGG CAC AAA TAT 14 8 8 
His Gly Arg Asn Ala Tyr Thr Glu Gly Met Val Tyr Gly His Lys Tyr 
485 490 495 

TTC AAA GAG ACA AAT TAC AAG AGG CTA GTA AGT GTG AAG ACT AAA GTT 1536 
Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val Ser Val Lys Thr Lys Val 
500 505 510 

GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA AGC ATC CCA ACT TTG TCA 15 84 
Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin Ser He Pro Thr Leu Ser 
515 520 525 

1589 

TCT TG 
Ser 

530 

(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Glu Thr Ser He Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
1 5 10-15 

Ser Ser Ala Thr Ser Arg Ser He Thr Asp Arg Phe He Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr 
35 40 45 



Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 



Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 

Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
85 90 95 

Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 
130 135 140 

Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

Tyr Gly Leu Ser Val Asp Asn He Val Asp Ala Gin He He Asp Val 
195 200 205 

Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

Ala He Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 

Lys He Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr He 
245 250 255 

Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr He Ala Glu Arg Trp Val 
260 265 270 

Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

Ser Val He Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala He Phe 
290 295 300 

Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 

310 315 320 



305 



Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 



Lys lie Lys Ser 
370 

Glu Phe lie Phe 
385 

Phe Asn Pro Tyr 



Pro Phe Pro His 
420 

Asn Trp Glu Asp 
435 

Thr Arg Leu Met 
450 

Arg Glu Ala Phe 
465 

His Gly Arg Asn 



Phe Lys Glu Thr 
500 

Asp Pro Asp Asn 
515 



Asp Tyr Val Gin 
375 

Glu Arg Met Lys 
390 

Gly Gly Arg Met 
405 

Arg Ser Gly Asn 



Leu Ser Asp Glu 
440 

Tyr Asp Tyr Met 
455 

Leu Asn Tyr Arg 
470 

Ala Tyr Thr Glu 
485 

Asn Tyr Lys Arg 



Phe Phe Arg Asn 
520 



Asn Pro lie Ser 
380 

Glu Leu Glu Asn 
395 

Ser Glu lie Ser 
410 

lie Ala Lys lie 
425 

Ala Glu Asn Arg 



Thr Pro Phe Val 
460 

Asp Leu Asp lie 
475 

Gly Met Val Tyr 
490 

Leu Val Ser Val 
505 

Glu Gin Ser lie 



Lys Arg Gin Phe 



Gin Met Leu Ala 
400 

Glu Phe Ala Lys 
415 

Gin Tyr Glu Val 
430 

Tyr Leu Asn Phe 
445 

Ser Lys Asn Pro 



Gly lie Asn Ser 
480 

Gly His Lys Tyr 
495 

Lys Thr Lys Val 
510 

Pro Thr Leu Ser 
525 



Ser 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



{iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 350 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GAGAAACTCG GAGACTTTCA CACAATGCCT AACCTCAAAC TCCGACCCCA AACATCCCAT 60 
CTCCCCCGCT ATCTTCTTCT CCGGAAATGG CTCCTACTCC TCCGTATTAC AAGCCAACAT 120 

CCGTAACCTC CGCTTCAACA CCACCTCAAC TCCGAAACCC TTCCTCATAA TCGCCGCAAC 180 



ACATGAATCC CATGTGCAAG CCGCGATTAC TTGCGGGAAA CGCCACAACC TTCAGATGAA 240 
AATCAGAAGT GGAGGCCACG ACTACGATGG CTTGTCATAC GTTACATACT CTGGCAAACC 3 00 
GTTCTTCGTC CTCGACATGT TTAACCTCCG TTCGGTGGAT GTCGACGTGG 3 50 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 278 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGCATGGATC TCCGCCGGAG CGACTCTCGG AGAGGTTTAT TATCGGATTT GGGAGAAAAG 6 0 
CAGAGTCCAT GGATTCCCCG CCGGAGTTTG AC CGACGGTT GGTGTTGGTG GGCATTTAAG 120 
CGGCGGTGGT TACGGTAACA TGGTGAGGAA GTTTGGATTA T CTGTGG ATT ACGTTGAGGA 180 
TGC CAAGATC GTCGATGTAA ACNGTCGGGT TTTAGATCGG AAAGCAATGG GTGAGGATCT 24 0 
GTTCTGGGCG ATTACCGGTG GAGGAGGAGG TAGCGTAC 278 

(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 



(ix) FEATURE: 



(A) * NAME / KEY : CDS 

(B) LOCATION: 2.. 345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TGGACATATT AGCGGAGGAG GATTCGGTAC AATAATGAGG AAATACGGTT TAGCGTCTGA 60 
TAACGTTGTG GACGCACGTT TGATGGATGT AAATGGGAAA ACTCTTGACC GGAAAACGAT 12 0 
GGGAGAGGAT TTGTTTTGGG CGCTTAGAGG CGGTGGAGCT GCGAGTTTTG GCGTTGTCTT 18 0 
GTCGTGGAAG GTTAAGCTTG C T AGGGTTC C TGAAAAGGTA ACTTGTTTCA TAAGTCAACA 240 
TCCGATGGGA CCTAGCATGA ACAAGCTTGT TCATAGATGG CAATCCATAG GATCAAGANN 3 00 
GCTAGACGAA GATTTATTCA TCAGAGTCAA TATTGACAAC AGTCT 34 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTTCGTTAAA ACCTATCCTN NANGGGCNAA AGNATATCAA AGNTTGNTTA NGNAACC CAA 60 
NATTTCTGAA CTGGCCNCCT T CGGTGGT AT ATGNCNAAAN CCCTTGAATC TGCGNANCCN 120 
ATTCCGCATA GAAACGGAAC CCTCTTCAAG ATTCTCTATT TACNCGAACT GNCTAGANNG 180 
AATGACAAGA CATCGAGTAG NAAAATCAAC TGGATCAAAG AGATATACAA TTACATGGCG 24 0 
CCTTATGTCT CAAGCAATCC AAGACAAGCA TATGTGAACT ACAGAGATCT AGACTTCGGA 300 
CAGAACAAGA ACAACGCAAA GGTTAACTTC ATTGAAGCTA AAATCTGGGG AC C T AAGT AC 360 
TTCAAAGGCA ATTTTGACAG AT TGGTGAAG ATTAAAACCA AGGTTGATCC AGAGAACTTC 420 
TTCAGGCACG AGCAGAGTAT CCCACCTATG CCCTACTAGA AGCTAGGTTC ATGAAACCAA 480 
TAACATTATC AAAAATAAGR ATAAATGRTA ATTGTATACA ACATGATTCG KCTTTCTTTA 540 
TTTCAGACAA TGTGGACACT ACTCTAAANT AAAAWGTCNA TTTACCTTAA AAAAAAAATA 600 



ATCCCCNNTA ANANAAAANT GGGGGGGCCN TTTTTGGGGN TCCCGGTTTT NGGACGGGGN 660 



GCTTTNGGGG GGCTTGGNNT TTTTTTNGGN GCCCC 6 95 



(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to TtiRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 495 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCTGTTTTNA GGCAGAGCAG AGGAAGTTGT TGCTTTGCTT GGTAAGGAGT TTCCTGAATT 60 
NAGTTTAAAG AAGGAGAACT GTTCGGAGAT GACTTGGTTT CAGTCAGCTT TATGGTGGGA 12 0 
TAATCGTGTT AACCCTACTC ANATTGATCC WAAAGTGTTT CTCGATCGGA ATCTTGATAG 18 0 
AGCGAATTTC GGAAAGAGGA AATCGGATTA CGTTGCGAGT AAGATTCCTA GAGATGGGAT 24 0 
TAAGYCTTTT TCCAAGARGA TGMCTGACCT GGGGAAAAYC GGGCTTGTTT TTAAWCCGTA 300 
TGGTGGGAAA ATGGCGGAGG TTACGGTTAA CGCGACGCCG TTTCCNCACC GAAGCAAGCT 36 0 
TTTTAAGATT CAGTACTCGG TGACTTNGCA AGAAAACTCT NTCGAGATAG AGAAAGGGTT 42 0 
TCTTGAATCA GGCTAACGTC CTTATAGGTT CATGACCGGG TTTTTNAGCA AGANCCCTGG 480 

495 

AATNCTTACT TNAAT 



(2) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE :" 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AAATTAAAAC AAATCAATGT TGATATTGAA TCCAATAGTG CTTGGTTTCA ACCTGGTGCT 60 

ACGCTTGGTG AGCTTTACTA CAGAATTNCA GAGAAGAGCA AAATCCATGG ATTTCCNGCG 12 0 

GGTTTNTNCA CAAGCNTAGG CATAGGTGGG TATATNANAG GCGGTGGATA CGGTACCTTG 18 0 

ATGAGGAAGT ATGGTCTTNC GGGA 204 

(2) INFORMATION FOR SEQ ID NO : 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GAGATTTCTC GAGCAAGATA CTCCACTGAT GATCTTTGAG C C ATTGGGTG GGAAAATCAG 60 
CAAGATTTCA GAAACAGAAT CTCCATATCC ACACAGAAGA GGTAATCTGT ATAATATACA 120 
GTACATGGTG AAATGGAAAG TGAATGANGT CGAGGAGATG AACAAACATG TCAGGTGGAT 180 
GAGATCGTTA CACGATTACA TGACTCCGTA TGTTTCTAAA TCGCCGAGAG GAGCTTATTT 240 
GANTTACAGA GATCTTGATT TGGGCTCGAC CAAAGGGATT AACACGGGTT TCGGAGATGC 300 
AAGGAAATGG NNGGGTGAGN CTTTTTTCAA AGGTAATTTC CAAGGGGTTA GGTTTTGGTT 360 
AAAGGGGAGG TTTNNCCCAN CAAATTTTTT TTCAGGANCC GGCCANGNTT TTCCCCCCCC 420 
TNTTTTTNGG NCCCCAATCN AAANCCCCGT TTTAAAAGGG GGGCCATTTC NTTTTTTNCA 480 

491 

NNTTAAAAGG G 



(2) INFORMATION FOR SEQ ID NO : 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 407 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 : 

ATTTGTTCGT GAGGTTAACT TTGACTTTAG TCAACGGTAC GAAGCCTGGT GAGAATACGG 6 0 

TTTTAGCGAC TTTCATTGGG ATGTATTTAG GCCGGTCGGA TAAGCTGTTG ACCGTNATGA 12 0 

ACCGGGATTT CCCGGAGTTG AAGCTGAAGA AAACCGATTN TAC CGAGATG AGATGGATCG 180 

ATTCGGTTCT GTTTTGGGAC GATTATCCGG TTGGT AC AC C GACTTCTGTG CTACTAAATC 240 

CGCTAGTCGC AAAAAAGTTG TTCATGAAAC GAAAATCGGA CTACGTGAAG CGTCTNATTT 3 00 

TCGAGAACCC GATCTCNNGT TTGATACTCA AGAAATTTGT AGAGGTTNNG AAAGTTAAAA 36 0 

TNAATTTGGA TCCGCATTNN GGNANNNATG GTGAAACCCC NNGTTNT 407 

(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 



(ix) 



FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 360 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ACGGCGTCGT ATTGGCCTAC AAAATAAACC TTGTTGAAGT CCCAGAAAAC GTCACCGTTT 6 0 

TCAGAATCTC C CGGACGTT A GAACAAAATG CGACGGATAT CATTCACCGG TGGCAACAAG 12 0 

TTGCACCGAA GCTTCCCGAC GAGCTTTTCA TAAGANCAGT CATTGACGTA NAAACGGCAC 18 0 

TGTTTCATNN CTCAAAAGAC CGTCAGACAA CATTCATAGC AATGTTTCTA GGAGACACGN 240 

CAACTCTACT GTCGATATTA AACCGGAGAT TCCCAGAATT GGGTTTGGTC CGGTCTGACT 3 00 

GTACCGNAAC AAGCNNTTGG ATCCAATCTG TGCTATTTTT GGGACAAATA TCCCAGGTTG 36 0 

(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 3.. 427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TCTTCACTGT CACCAAAACG TTAGAACAAG ACGCAAGATT GAAGACTATT TCTAAGTGGC 60 
AACAAATTTC ATC C AAGATT ATTGAAGAGA TACACAT CCG AGTGGTACTC AGAGCAGCTG 12 0 
GAAATGATGG AAACAAGACT GTGACAATGA CCTACCTAGG TCAGTTTCTT GGCGAGAAAG 180 
GCACCTTGCT GAAGGTTATG GAGAAGGCTT TTCCAGAACT AGGGTTAACT CAAAAGGATT 240 
GTACTGAAAT GAGCTGGATT GAAGCCGCCC TTTTCCATGG TGGRTTTCCA ACAGGKTCTC 300 
CTATTGAAAT TTTGCTTMAG CTCAAGTCGC CTYTAGGAAA AGRTTWCTTC AAAGCAACGK 36 0 
CGGATTTCGT TAAAGAACCT WTTCCTGTGA TAGGGCTCAA AGGAATATTC AAAAGATTGA 420 

427 

TTGAAGG 

(2) INFORMATION FOR SEQ ID NO : 31: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 37 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI- SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..437 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTTGTACTAT CATNGAAGAT TAAGTTAGTC GATGTTCCGT CCACGGTCAC CGNGTTTAAA 60 
GTCCAGAAAC ATNAGGAGAA AG AGGC CGTT AGGNTCATCA ACAAGTGGCA GTATGTTGCG 12 0 
GATAAGGTCC CTGAAGATCT TTTCATCAGC GCAACGTTGG NGAGATCAAA CGGAAACTCT 180 
GTGCAGGCTT TGTTTACTGG ACTCTATCTT GGNCCGGTGA ATAATNTCTT GGCCTTGATG 240 
GAAGAAAAGT TTCCAGANTT AGGTCTTGAT ATCCAAGNCT GCACAGAGAT GAGTTGGGCT 3 00 
GAATCTGCAC TCTGGTNTNC TGNTTTCNCT AAAGGAGAGN CTCCTTGGGT GTTCCNCGCG 360 
GATCGGNAGC GGNCAATTTN TGGNCTTTCA AGGGGAAAGN CGGCTTTTTN CAAGAACCCG 42 0 

437 

NTACCCGGGG TTCAATT 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: CDNA to mRNA 

(iii) HYPOTHET I C AL : NO 

(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..441 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GCGGACCCTA TAGATCANNA TGTGC T ACTG ANAGAAGAGG AAGCCAAGAA CAAGCCGGAG 60 
ACAGATAAAT ATCTGAAATG GGNCGATANC GTTTACGAAT TTATGACNCC ATATGTTTCG 120 



* 



AAATCTCCAA GAGGAGCTTA TGTCAATTTC AAGGATATGG ATTTGGGTAT GTATCTTGGA 18 0 

AAGAAGAAGA CAAAGTACGA GGAAGGAAAG AGTTGGGGAG TGAAGTATTT CAAGAACAAT 24 0 

TTCGAGAGAT TGGTGAGAGT GAAGACTAGG GTTGATCCAA CAGATTTCTT CTGCGATGAA 30 0 

CAGAGCATTC CTCTGGTGAA CAAAGTTACC TGAAGATATC AT TTG AAGTT TTTTATTAGT 360 

CCCTTTTCTC TGTGAAATCA TCTGTGCGTG TTGAATATTA TGCGTCAAGT GTGTAACTTA 42 0 

TGTGTGTGAT TGTGAATTGT G 441 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2 . . 502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTGGCTTAAC ACAACGTCGT TTTGGGCCAA TTACCCGGCG GGT AC AC C C A AGAGCATCCT 6 0 
TCTAGATAGG CCTCCGACGA ATTCAGTGTC ATTTAAGAGT AAATCGGATT TTGTCAAAAA 120 
ACCAATACCC AAAAAAGGTT TAGAGAAGCT TTGGAAGACA ATGTTTAAAT TCAACAGTAG 180 
CGTCTCGTTG CAATTCAACC CTTACGGTGG AGTGATGGAC CGGATTCCGG CAACGGCCAC 240 
CGCTTTTCCT CATCGGAAAG GAAACTTGTT CAAGGTTCAA TACNCTACGA TGTGGTTTGA 300 
CGCAAACGCC ACACAGAGTA GCCNGGCTAT GATGAATGAG CTTTTTGAGG TGGCGGGACC 360 
GTACGTGNGT CAAGTAAACC CGAGANANGG CTTCCTTTAA NTTCAGAGNC CATCGNTNTT 42 0 
NGGAGCAANN CCAAGTGGGG GGGNCCAACC GGGGGNTNAA ANCNNAGNTC TTNGGGGGCC 480 

502 

CAGAATTTCC TTNGGGGAAT TT 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 400 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
NGGGAATTGC NCGAGGNAAG TTGTACCCAA TTCCTGGACC AC CATTGGTT TCCCAAGAAN 60 
CCCGAGACAA CCGTTTTTCA ATNAC CGTGA TGTTGATTTG GGTATTAATT CTCATAATGG 12 0 
TAAAATCAGT AGTTATGTGG AAGGTAAACG TTACGGGAAG AAGTATTTCG CAGGTAATTT 18 0 
CGAGAGATTG GTGAAGATTA AGACGAGAGT TGATAGTGGT AATTTCTTTA GGAACGAACA 24 0 
GAGTATTCCT GTGTTACCAT AAGTGTATTT ATTTGATTAT TGGTTAGTGA AATTTGTTGT 3 00 
TGTATAATGA TTATATGTCG TATTTTTATT TATTATTAGT AATTTATAAA GTTTGATATT 36 0 
AAATACAAAT AGTATAATAA GATAGTTTCT TTTAGTAAAA 400 



(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.-383 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CAACTCTAAT GGGAACACCT ACTTCGATCG AATGTCGATG GGGGAAGAGC TTTTCTGGGC 6 0 
GGTTCGAGGA GGTGGAGCCG CGAGTTTCGG CATCGTGATG GGATACAAAA TCCGGTTGGT 12 0 




TCCGGTTCCG GAGAAAGTTA CGGTTTTTAG CGTCGGAAAA ACCGTCGGAG AAGGAGCCGT 180 

TGATCTTATA ATGAAGTGGC AGAACTTCTC TCATAGTACG GNTCGGAATT TNTTTGTGAA 24 0 

GCTGANTTTT GANTTTAGTC AACGGTGCAA AGCCGGGTGA AAAAAAGGTT TTAGNGNCTT 3 00 

TCANTTTGGN TGNAANCTTG GGGGTTTTAT NAGAACGGTT AACCGGGATT NANCC CGNGT 3 60 

TTTCCCGGGG TTAAAACCTT NGG 383 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
ATCAATGTTC TTAC TAAACG TACACGAGCA TCGTTGGCTT TCAAGGCTAA ATCTGATTTT 6 0 
NTTCAAGAAC CGATNCCTAA AACCGCGATT TCGAAGCTTT GGAGACGGTT GCAAGAACCG 12 0 
GAAGCAGAGC ATGCTCAGCT AATTTNCACN C C ATTTGGTG GTAAAATGAG TNAGATTGCA 18 0 
GATTACGAAA CACCATTTCC GCATAGGAAG GGGAATATAT ATNAGATTCA GTACTTGAAT 240 
TACTGGAGAG GAGACGTGAA AGAGAAGTAT ATTGAGATNG GTGGAGGAGA GTTTACGGTT 3 00 
GNTATNAGTA AGTTTTTTGG CGAAGTNTNC CNAGAGGNGN CTTNNTNTAA ACCT 3 54 



(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidposis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TTTTTTAGTA CACTAATAAT CAAATGGAAT GAGAAATGAA GCCACAAAAG TATCTGCAAT 60 
CAAAATATCC TGCTATCTCC ATCTCAAGCT CTCAATAGTA TCCTCTCCGA AAGTGAAATC 12 0 
AACATTTCAA ACTCTATTTC TTGGTGGAAT CGATAGACTG ATTCCTCTGA TGAACCAGAA 180 
GTTTCCGGAA CTCGGCTTAC GATCTCAAGA CTGTTCGGAA ATGAGCTGGA TCGAATCGAT 240 
AATGTTCTTC AACTGGAGAT CAGGACAGCC GTTAGAGATT TTGCTCAACA GAGACCTAAG 300 
GATTCGAGGA TCAGTATTTC AAAGCAAAGT CAGGATTATG GTTCAAAAAC CCGTTCCTGA 360 
AAACGTTTTT CGAAGAGGTA TCCAAGGGGT TTCTCGAGCA AGT 4 03 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GAGATGAGTT GGATTAANTC TGTACTCTGG TTTGCTGATT TCCCTAAAGG AGAATCTCTT 60 
NGTGTTCTCA CGAATCGTAA GCGTACATCT CTAT CTTTNA AAGGCAAAGA TGATTTTATC 120 
C AAGAAC CG A TACCCGAGGC TGCAATTNAA GAGATATGGA GGCGATTAGA AGCCCCCNAG 18 0 
GCTCGGCTTG GAAAGATCAT ATTAACTCCA TTTGGTGGGA AAATNAGTGA AATGGCAGAG 240 

260 

TACGTANCAC CATTCCCACA 



(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
{iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 605 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 
CTCTTGCATA TTCGCTGCAA GGATGGGAAA TTCAAAACCA CTCCCTACAA TT TTTTGT AT 60 
TATAGTTTCA GTCTTGTATT TTTAATTCTA TTGCATAACA CCAACTTCTT CATCAGCCTC 12 0 
CATCCAAGAT CAATTCATAA ACTGTGTCAA AAGAAACACA CATGTTTCTT TTCCACTCGA 18 0 
GAAAACGTTA TTCACCCCTG CGAAAAACGT CTCTTTGTTC AACCAAGTCC TTGANTCGAC 240 
GGCTCAAAAT CTCCAGTTCT TGGCAAAATC CATGCCTAAA CCGGGRTTCA TATTCAGACC 300 
GATTCACCAG TCTCAAGTCC AAGSTTCCAT CATTTGTTCA AMGRAACTCG GGNTTCATTT 36 0 
TNGTGTTTGA NGTGGCGGTC ACGATTTTCG AGGCCTTTGT NTTTATGTTT CACGGTTTGA 42 0 
AAAAACCGTT TAT ATT ACT C GGCCTGTCAA ANTTGNANNC AAAATCANAT GTTGGATATT 480 
GNATTCCAAA TAGGTNCTTG GGGTNAACCT GGTGGCTANC GTTTGGTGAG CTTTTACTTT 540 
CAAGAATTTG CANGNGGANG TGCAAAGATT CCATGGGATT TCCCGGGGGG TTTNTTGCAC 600 
AATGT 605 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 



(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
AACACAAAAC TCTTCCATTT GGCTTCTCTC TTGCATATTC GTTGCAAGGA TGGGAAATTC 6 0 
AAAACCACTC CCTACAATTN CTTGTATTAT CGTTTCAGTC TTGTATTTTN NATTCTATTG 12 0 
CATAACACCA ACTTCTTCAT CAGCCTCCAT CCAAGNTCAA TTCATAAACT GTGTCAAAAG 180 
GAACACACAT GTTTCTTTTC CACTCGAGNA AACGGTATTC ACTCCTGCGG AAAACGGCTC 240 
TNTTATTCAA CGGGTC CNTG AATCGACGGG TCAAAATCTC CAGTTCTTGG NAAAATCCAT 3 00 
GNCTAAACCG GGGTTCATAT TCAGGCCGGT TCACCAGTCT CAAGTCCAAG NTTCCATCAT 3 60 
TTGTTCAAAG^ GAACTCGGGA TTCATTTCCG CGNTAGAAGT GGCGGGCANN GGTTT CGGGG 42 0 
CCTGTCTNTT GNTTANGGGN AGGAAAACCG GTTNTATTNC TCGG 4 64 

(2) INFORMATION FOR SEQ ID NO : 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..386 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
TCGGGAGCCC ANGNTAAATT ANNTGAAAAT GGGGNCGNAT ANCCGTTTAC NGAATTTTAT 6 0 
GACNCCCAAT ATGTTTCGAA ATCTCAAAGA NNGGGANCTT ATGTCAATTT CAAGGATATG 12 0 
GATTTGGGTA TGTATCTTGG AAAGNAGAAG ACAAAGTACG AGGAAGGAAA GAGTTGGGGA 180 
GTGAAGTATT TCAAGAACAA TTTCGAGAGA TTGGTGAGAG TG AAG AC TAG GGTTGATCCN 240 
ACAGATTTCN TCTGCGATGA ACAGAGCATT CCTCTGGTGN ACAAAGTTAC CTGAAGATAT 3 00 
CATTTGAAGT TTTTTATTAG TCCCTTTTCT CTGTGAAATC ATCTGTGCGT GTTGAATANT 360 



ATGCGTCAAG TGTGTAACTT ATGTGT 



386 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

TACCATAGGG AGGTGGTGNA AG AT TTTGT A TGTAGNC TT A GGGGAAGGCG AGTAGTATGG 6 0 

TGGTGGTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 12 0 

TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA CAGATTGGTG 18 0 

AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG TATCCCACCT 24 0 

ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT ATCAAAAATA AGAATAAATG 3 00 

ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC ACTACTCTAA 36 0 

ATAAAATGTC ATTTACC 377 



(2) INFORMATION FOR SEQ ID NO : 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 



(ix) FEATURE: 



(A) NAME / KEY : CDS 

(B) LOCATION: 1..377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TACCATAGGG AGGTGGTGNA AGATTTTGTA TGTAGNCTTA GGGGAAGGCG AGTAGTATGG 6 0 
TGGTGGTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 12 0 
TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA CAGATTGGTG 18 0 
AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG TATCCCACCT 240 
ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT ATCAAAAATA AGAATAAATG 300 
ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC ACTACTCTAA 36 0 
ATAAAATGTC ATTTACC 3 77 

(2) INFORMATION FOR SEQ ID NO : 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 346 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GAGCTGTGGA TATGGTCACA AATGGCAATC GGTTGGTCCG AAAACTGATC CGAATCTTTT 60 

TATGAGAATN TTGATTCAAC CAGTGACGAG GAAGAAGGTA AAGACTGTGA GAGCTTCTNT 12 0 

GGTTGCCCTN TTTTNAGGCN AGACAGATGA AGTTTTTGCT TTC CTTAGT A AGGAGTTTCC 180 

TGAATTGGGT TTAAAGAAGG AGAATTNTTC GGAGATGACT TGGTTTCANT CTGCTTTATG 24 0 

GTGGGACAAT CGTCTTAATG CTACTCAGGT TGATC CTAAA GTNTTTCTTG ATCGGAATCT 300 

CGATACCTCG AGTTTCGGTA AGAGGAAATC GGATTACGTC GCGACT 346 

(2) INFORMATION FOR SEQ ID NO : 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.-261 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
ATGGGGTGAG ACTTATTTCA AAGGTAATTT CAAGAGATTA GGTTTGGTTA AAGGGAAGNT 60 
TGATCCAACA AATTTCTTCA GGAACGAACA GAGTATTCCT CCTCTGTTTT GAGTCCTCAA 12 0 
TACAAAACCA GATATAAAAG ATGTCATTTC ATTTTTTCAA TTATAATAGA TAATGTAACT 18 0 
TTCTGCTACA ATTGTAAAAG TGAGATGTAC CCAATACGGT TTAAGCGGAC CGAGAATAGT 24 0 
CAATTCAAAG AC CAAATTC T G 261 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..478 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GCTCAAAGGA CTAACCATGA AAACTTCCTC AAGTGTCTCT CTC AC CG ANT CAACGAGGAC 6 0 
GACTCAAGAN TTATACACAC ATCAAAAGAT CCTTCGTATT TNTCAATCTT GATTTCTTCC 12 0 
ATACAAAATC CAAGTTTCTC TGTTCTCGAA ACACCTAAAC CGGTTTCAAT CATCACTCCG 180 



GTTCAAGCCA CCGATGTTCA ATCTACGNTT AAATNCGCAC GGNCTTCACG GGTATACACA 24 0 

ATCAGGGCTA GGAGTGGTNG TCATGACTAC GGAGGTTTAT CTTTACATTG GCTTAAAAAN 3 00 

CANNCCGTTC GTTNNT C ATT GATTTNNAGA AATCTTCCGG GCTTATTTAA CATNTAAGAT 360 

GTTTGATAAN CCGGNNCCNG TTTGGGGTTC AAATCCCGGT GGCTTACAAA NTTNGGGGGA 42 0 

ATTGTNCCTA TGAGGTTTGG AAAATTAANG CAAAATNTTT TGGGCCTTCC CGGCCGGT 478 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.-579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GGCCGTTAGG ATCATCAAGA AATGGCAATA TGCTGCAGAT AAGGTTCCTG ATGATCTTTT 6 0 
CATTAGGACA ACATTGGAGA GATCAAACAA GAACGCAGTA CACGCTTTGT TCACTGGACT 12 0 
ATATATTGGT CCGGTGAACA ATCTATTGGC GTTGATGGAA GAAAAGTTTC CGGAACTAGG 18 0 
TCTTGAGAAA GAAGGTTGTG AAGAGATGAG TTGGATTGAG TCTGTACTCT GGTTTGCTGA 240 
TTTCCCTAAA GGAGAATCTC TTGGTGTTCT CACGAATCGT GAGCGTACAT CTCTATCTTT 300 
CAAAGGCAAA GATGATTTTG TCCAAGAACC GATACCCGAG GCTGCAATTC AAGAGATATG 36 0 
GAGGCGATTA GAAGCCCCCG AGGCTCGGCT TGGAAAGATC ATATTAACTC CATTTGGGTG 42 0 
NGGNAAAATG AGTGAAATGG CAGAGNCCGA ACCACCAATT CCCACANNCG AGGGAGGGGA 480 
ACCCCTNTGN GGNTCAGAAT GTGGTTCCTG GNNNNNAAGN GGGNGCCAGN ACCAANCCGG 54 0 
GNCNGTAAAN CNTGNAATGG GCCNAACCCG TNCCGGATT 57 9 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 52 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(B) STRAIN: Nipponbare, subsp . japonica 

(D) DEVELOPMENTAL STAGE: etiolated shoot (8 days old) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 252 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

TGTCCTGGAA GGTCCGCCTC GTGCAGGTTN CGACGACGGT GACGGTGTTC GTCGTCGGGA 60 

GGAACGTCGA CCAGGGCGCC GCNGACGTCG TCGCCAGATG GCAAGACGTC GCGCCGAGCC 12 0 

TCCCTCCCGA GCTCACCATA CGGGTGATCG TNCGAGGGCA GCGCGCCACG TTCCAGTCGC 18 0 

TGTACCTCGG CTCGTGCGCC GACCTGGTGC CGACGATGAG CAGCATGTTC CCGGAGCTCG 24 0 

GGATGACGAT TG 252 



(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid. 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= Ambiguous 
/note= "Xaa = Cys or Ser" 

(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 20.. 21 

(D) OTHER INFORMATION: /label= ambiguous 

/note= "Xaa-Xaa probably is Ser-Phe" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Xaa Leu Asn Asn Arg 
15 10 15 

Ala Asp Pro Xaa Xaa 
20 



INFORMATION FOR SEQ ID NO : 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Ser" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /labels unknown 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Ser" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Trp" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 24 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Tyr" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Xaa lie Xaa Val Xaa He Glu Asp Glu Thr Ala Xaa Val Gin Ala Gly 
15 10 15 

Ala Thr Leu Gly Glu Val Tyr Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
( i i i ) HYPOTHET I CAL : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Tyr Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO : 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ACTTCTACTT CTATTATTGA TAGGTTTACT CA 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..405 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



ACT TCT ACT TCT ATT ATT GAT AGG TTT ACT CAA TGT CTA AAC AAC CGA 48 
Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
15 10 15 

GCT GAC CCT TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC 96 
Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
20 25 30 

TCC TCT TTT CCA TCC GTC TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC 144 
Ser Ser Phe Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe 
35 40 45 

AAT GAA TCC ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC 192 
Asn Glu Ser Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His 
50 55 60 

CCT TCA CAC ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG 24 0 
Pro Ser His lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu 
65 70 75 80 

CTA ATG AAA ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCC TAT 288 
Leu Met Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr 
85 90 95 

GTG ACC AAT TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA 336 
Val Thr Asn Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu 
100 105 110 

CGC TCC ATA AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCC 3 84 
Arg Ser lie Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala 
115 120 125 

GGC GCC ACC CTC GGA GAA GTT 4 05 

Gly Ala Thr Leu Gly Glu Val 
130 135 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
15 10 15 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
20 25 30 

Ser Ser Phe Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe 
35 40 45 

Asn Glu Ser Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His 
50 55 60 



Pro Ser His lie Gin Ala Ala Val Vai Cys Ala Lys Thr His Arg Leu 
65 70 75 80 



Leu Met Lys Thr Arg Ser Gly Gly 
85 

Val Thr Asn Ser Asn Gin Pro Phe 
100 

Arg Ser lie Asn Val Ser lie Glu 
115 120 

Gly Ala Thr Leu Gly Glu Val 
130 135 



His Asp Tyr Glu Gly Leu Ser Tyr 
90 95 

Phe Val Val Asp Met Phe Asn Leu 
105 110 

Asp Glu Thr Ala Trp Val Gin Ala 
125 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CACGTTTATG GAGCGTAAGT TGAAC 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

CACCCTTCAC ACATTCAAGC AGC 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iii) 



ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 7 . . 1626 

(ix) FEATURE: 

(A) NAME/KEY: unsure 

(B) LOCATION: 372 

(D) OTHER INFORMATION: location 372 may be "C" or "G" 

(ix) FEATURE: 

(A) NAME/KEY: unsure 

(B) LOCATION: 3 79 

(D) OTHER INFORMATION: location 379 may be "A" or "G" 

(ix) FEATURE: 

(A) NAME /KEY : unsure 

(B) LOCATION: 786 

(D) OTHER INFORMATION: location 786 may be "C" or "T n 

(ix) FEATURE: 

(A) NAME/KEY: unsure 

(B) LOCATION: 1105... 1106 

(D) OTHER INFORMATION: location 1105... 1106 may be "AG", 

" GA " , "GG" or "AA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ACAAAA ATG GCA ATT ACC TAT TCT TTC AAC TTC AAA TCT TAT ATT TTT 4 8 

Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr lie Phe 
15 10 

CCT CTC CTC CTT GTC TTG CTC TCT ACC CAT TCA TCA GCG ACT TCA ACT 96 
Pro Leu Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr 
15 20 25 30 

TCC ATT ATA GAT CGC TTC ACC CAA TGT CTA AAC AAC CGA GCT GAC CCT 144 
Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro 
35 40 45 

TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC TCC TCT TTT 192 
Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe 
50 55 60 

CCA TCC GTC TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC AAT GAA TCC 240 
Pro Ser Val Leu Gin Ala Tyr He Arg Asn Leu Arg Phe Asn Glu Ser 
65 70 75 

ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC CCT TCA CAC 288 
Thr Thr Pro Lys Pro He Leu He He Thr Ala Leu His Pro Ser His 
80 85 90 

ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG CTA ATG AAA 336 
He Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys 
95 100 105 110 



ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCS TAT GTG RCC AAT 3 84 
Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Val Xaa Asn 
115 120 125 

TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA CGC TCC ATA 432 
Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser lie 
130 135 140 

AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCT GGT GCG ACT 480 
Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr 
145 150 155 

CTT GGT GAA GTC TAC TAC CGA ATA GCA GAG AAA AGC AAC AGT CAT GCT 52 8 
Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu Lys Ser Asn Ser His Ala 
160 165 170 

TTT CCG GCT GGC GTT TGC CCT ACT GTT GGA GTT GGT GGC CAT TTT AGT 5 76 
Phe Pro Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser 
175 180 185 190 

GGT GGT GGT TAT GGT AAC TTG ATG GGA AAA TAC GGC CTT TCT GTT GAC 624 
Gly Gly Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp 
195 200 205 

AAT ATT GTC GAT GCT CAG TTA ATC GAT GTG AAT GGT AAA CTT CTG AAT 6 72 
Asn lie Val Asp Ala Gin Leu lie Asp Val Asn Gly Lys Leu Leu Asn 
210 215 220 

CGG AAA TCA ATG GGT GAA GAT CTT TTT TGG GCC ATC ACA GGT GGT GGT 72 0 
Arg Lys Ser Met Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly 
225 230 235 

GGT GTC AGC TTT GGT GTG GTT GTA GCG TAC AAG ATC AAA CTG GTT CGT 76 8 
Gly Val Ser Phe Gly Val Val Val Ala Tyr Lys lie Lys Leu Val Arg 
240 245 250 

GTT CCT ACC ACT GTG ACY GTT TTT AAC GTA CAA AGA ACA TCC GAG CAG 816 
Val Pro Thr Thr Val Thr Val Phe Asn Val Gin Arg Thr Ser Glu Gin 
255 260 265 270 

AAC CTA AGC ACC ATA GCC CAC CGA TGG ATA CAA GTT GCG GAT AAG CTC 864 
Asn Leu Ser Thr lie Ala His Arg Trp lie Gin Val Ala Asp Lys Leu 
275 280 285 

GAT AAT GAC CTT TTC CTT CGA ATG ACC TTT AAC GTG ATA AAC AAC ACA 912 
Asp Asn Asp Leu Phe Leu Arg Met Thr Phe Asn Val lie Asn Asn Thr 
290 295 300 

AAT GGC GAA AAG ACG ATA CGT GGT TTG TTT CCA ACA CTG TAC CTC GGA 960 
Asn Gly Glu Lys Thr He Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly 
305 310 315 

AAC TCT ACC GCT CTT GTT GCC CTC CTG AAC AAG GAT TTC CCT GAA TTA 1008 
Asn Ser Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu 
320 325 330 

GGT GTA GAA ATT TCA GAT TGT ATT GAA ATG AGT TGG ATC GAG TCT GTT 1056 
Gly Val Glu He Ser Asp Cys He Glu Met Ser Trp He Glu Ser Val 
335 340 345 350 



CTT TTC TAC ACA AAC TTC CCC ATT GGT ACT CCG ACC ACT GCT CTT CTA 
Leu Phe Tyr Thr Asn Phe Pro lie Gly Thr Pro Thr Thr Ala Leu Leu 
355 360 365 



1104 



RRC CGT ACA CCT CAA AGA CTA AAC CCA TTC AAA ATC AAA TCT GAT TAC 1152 
Xaa Arg Thr Pro Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr 
370 375 380 

GTA AAA AAC ACT ATT TCC AAA CAG GGA TTC GAA TCC ATA TTT GAA AGG 12 00 
Val Lys Asn Thr lie Ser Lys Gin Gly Phe Glu Ser He Phe Glu Arg 
385 390 395 

ATG AAA GAA CTC GAA AAC CAA ATG CTA GCT TTC AAC CCT TAT GGT GGA 1248 
Met Lys Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly 
400 405 410 

AGA ATG AGC GAA ATT TCC GAA TTT GCA AAG CCT TTT CCC CAT CGA TCA 12 96 
Arg Met Ser Glu He Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser 
415 420 425 430 

GGG AAT ATA GCG AAG ATC CAA TAC GAA GTA AAC TGG GAT GAA CTT GGC 1344 
Gly Asn He Ala Lys He Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly 
435 440 445 

GTT GAA GCA GCC AAT CGG TAC TTG AAC TTC ACA AGG GTG ATG TAT GAT 13 92 
Val Glu Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp 
450 455 460 

TAT ATG ACT CCG TTT GTT TCT AAG AAC CCC AGG GAA GCA TTT CTG AAC 1440 
Tyr Met Thr Pro Phe Val Ser Lys Asn Pro Arg Glu Ala Phe Leu Asn 
465 470 475 

TAC AGG GAT TTA GAT ATT GGT GTC AAC AGT CAT GGC AAG AAT GCT TAC 14 88 
Tyr Arg Asp Leu Asp He Gly Val Asn Ser His Gly Lys Asn Ala Tyr 
480 485 490 

GGT GAA GGA ATG GTT TAT GGG CAC AAG TAT TTC AAA GAG ACG AAT TAT 153 6 
Gly Glu Gly Met Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr 
495 500 505 510 

AAG AGG CTA ACG ATG GTG AAG ACG AGG GTT GAT CCT AGC AAT TTT TTT 1584 
Lys Arg Leu Thr Met Val Lys Thr Arg Val Asp Pro Ser Asn Phe Phe 
515 520 525 

AGG AAT GAG CAA AGT ATC CCA ACT TTG TCA TCT TCA TGG AAG 1626 
Arg Asn Glu Gin Ser He Pro Thr Leu Ser Ser Ser Trp Lys 
530 535 540 

TAAATTCTAA ATTCACTTGT GAAATTGAAT AAAAGTATGG CTTTTTCAAG GTCATGGTAT 1686 

C C AGATTC AG ATGATATTGA TATAATTTTG ACTTGTATTT ATACAAACAA AATTATATTA 1746 

TATTTTTCTG AATTTAGATT TTCCATTCTT TGGAAAAATA TACGAACATT GATGTTGATA 18 06 

TTTTTAAGAA TTATAGATTT TGAACATTGT GAACAATGAA TAAACCGAGG ACTTCCCTTG 1866 

GGTTTTTTTT ATAAGTATGT AATAGCATGT CTTTAATCAA GATAACCGAT CATTGGATGC 1926 

AATTTATTAT TATAAACCTT ATTTAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA 1981 



(2) INFORMATION FOR SEQ ID NO : 58: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr lie Phe Pro Leu 
15 10 15 

Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr Ser lie 
20 25 30 

lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro Ser Phe 
35 40 45 

Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe Pro Ser 
50 55 60 

Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe Asn Glu Ser Thr Thr 
65 70 75 80 

Pro Lys Pro lie Leu lie lie Thr Ala Leu His Pro Ser His lie Gin 
85 90 95 

Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys Thr Arg 
100 105 110 

Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Val Thr Asn Ser Asn 
115 120 125 

Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser lie Asn Val 
130 135 140 

Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr Leu Gly 
145 150 155 160 

Glu Val Tyr Tyr Arg lie Ala Glu Lys Ser Asn Ser His Ala Phe Pro 
165 170 175 

Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly 
180 185 190 

Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp Asn lie 
195 200 205 

Val Asp Ala Gin Leu lie Asp Val Asn Gly Lys Leu Leu Asn Arg Lys 
210 215 220 

Ser Met Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly Gly Val 
225 230 235 240 

Ser Phe Gly Val Val Val Ala Tyr Lys lie Lys Leu Val Arg Val Pro 
245 250 255 

Thr Thr Val Thr Val Phe Asn Val Gin Arg Thr Ser Glu Gin Asn Leu 
260 265 270 



Ser Thr He Ala His Arg Trp He Gin Val Ala Asp Lys Leu Asp Asn 
275 280 285 

Asp Leu Phe Leu Arg Met Thr Phe Asn Val He Asn Asn Thr Asn Gly 
290 295 300 

Glu Lys Thr He Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly Asn Ser 
305 310 315 320 

Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Val 
325 330 335 

Glu He Ser Asp Cys He Glu Met Ser Trp He Glu Ser Val Leu Phe 
340 345 350 

Tyr Thr Asn Phe Pro He Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg 
355 360 .365 

Thr Pro Gin Arg Leu Asn Pro Phe Lys He Lys Ser Asp Tyr Val Lys 
370 375 380 

Asn Thr He Ser Lys Gin Gly Phe Glu Ser He Phe Glu Arg Met Lys 
385 390 395 400 

Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly Arg Met 
405 410 415 

Ser Glu He Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn 
420 425 430 

He Ala Lys He Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly Val Glu 
435 440 445 

Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp Tyr Met 
450 455 460 

Thr Pro Phe Val Ser Lys Asn Pro Arg Glu Ala Phe Leu Asn Tyr Arg 
465 470 475 480 

Asp Leu Asp He Gly Val Asn Ser His Gly Lys Asn Ala Tyr Gly Glu 
485 490 495 

Gly Met Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr Lys Arg 
500 505 510 

Leu Thr Met Val Lys Thr Arg Val Asp Pro Ser Asn Phe Phe Arg Asn 
515 520 525 

Glu Gin Ser He Pro Thr Leu Ser Ser Ser Trp Lys 
530 535 540 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GGTAATGATC TCCTTTCTTG TTTGACC 



(2) INFORMATION FOR SEQ ID NO : 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
AGAGCGGCCG CTATATTACA ACTTCTCCAC CATCACTCCT C 



(2) INFORMATION FOR SEQ ID NO : 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GGTGATGTTA ATGATAATCT CCTC 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 
AGAGCGGCCG CTACAATTCC TTCAACATGT AAATTTCCTC 



(2) INFORMATION FOR SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
ACTTCCCGTA GAAACTCGGA GACTTTCACA CAATGC 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TCCATCCAAG ATCAATTCAT AAACTGTGTC 



(2) INFORMATION FOR SEQ ID NO : 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGAGCGGCCG CTTTCATGAA CCTAGCTTCT AGTAGG 



(2) INFORMATION FOR SEQ ID NO : 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



AGAGCGGCCG CGAAATGGCC CCCCTTTTAA AACGGGG 



(2) INFORMATION FOR SEQ ID NO : 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AGAGCGGCCG CAAATGATAT CTTCAGGTAA CTTTGTTCAC 



(2) INFORMATION FOR SEQ ID NO : 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
AGAGCGGCCG CATAATCAAA TAAATACACT TATGGTAACA CAG 



(2) INFORMATION FOR SEQ ID NO : 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 69: 
AGAGCGGCCG CTGGTTTTGT ATTGAGGACT CAAAACAG 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 57 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO J 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: j oin ( 1 . . 570 , 801.. 1754) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 4 8 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 
Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 
Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 

AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 
Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 

GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 24 0 
Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 2 88 
Gin Met Lys He Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 336 
Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 HO 

CGT TCG GTG GAT GTC GAT GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 3 84 
Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 43 2 
Gly Ala He Leu Gly Glu Val Tyr Tyr Tyr He Trp Glu Lys Ser Lys 
130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 48 0 
Thr Leu Ala Tyr Pro Ala Gly He Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 



"CAT ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAC GGT CTC 52 8 
His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTT AAT 57 0 

Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn 
180 185 190 

GGTATAATTG ATATCTCTAT TTTATATACT AATTAAATTT TATAGTGTGG ATC GG AT AGT 63 0 

GATTTTGGTC CATCAATTAA AAACTTGGTG AACATAAAAT TAACCAAGCA ATCAATTTAG 69 0 

ACAAGCAACA TAATCATATA TATTTTTCTT ACATTTGTAT GT AC CTGAAT ATTTATATTT 750 

ATGTTTATAT GTTCTCACTA TATTTTCACT TTTGTATTTG AAAATTTTTA GGA AAA 8 06 

Gly Lys 

ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAC TGG GCA ATA AAC 854 
lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

GGA GGA GGA GGA GGG AGC TAC GGC GTC GTA TTG GCC TAC AAA ATA AAC 902 
Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 
210 215 220 

CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 95 0 
Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr 
225 230 235 240 

TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 9 98 
Leu Glu Gin Asn Ala Thr Asp He He His Arg Trp Gin Gin Val Ala 
245 250 255 

CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAC GTA GTA 1046 
Pro Lys Leu Pro Asp Glu Leu Phe He Arg Thr Val He Asp Val Val 
260 265 270 

AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 10 94 
Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe He Ala 
275 280 285 

ATG TTT CTA GGA GAC ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 1142 
Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser He Leu Asn Arg Arg 
290 295 300 

TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 1190 
Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 12 3 8 
He Gin Ser Val Leu Phe Trp Thr Asn He Gin Val Gly Ser Ser Glu 
325 330 335 

ACA CTT CTA CTC CAA AGG AAT CAA CCC GTG AAC TAC CTC AAG AGG AAA 12 86 
Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

TCA GAT TAC GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 13 34 
Ser Asp Tyr Val Arg Glu Pro He Ser Arg Thr Gly Leu Glu Ser He 
355 360 365 



TGG AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 13 8 '2 ' 
Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
370 375 380 

TAC GGT GGT GAG ATG GGG AGG ATA TCA TTA CGG GTG ACT CCG TTC CCA 14 3 0 
Tyr Gly Gly Glu Met Gly Arg lie Ser Leu Arg Val Thr Pro Phe Pro 
385 390 395 400 

TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 1478 
Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 1526 
Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 1574 
Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT AAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 16 22 
Asn Asn Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 16 7 0 
Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 1718 
Asn Phe Glu Arg Leu Val Lys He Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

TTC TTT AGG AAC GAA CAC AGT ATT CCT GTG TTA CCA TAA 17 57 

Phe Phe Arg Asn Glu His Ser He Pro Val Leu Pro 
500 505 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: . 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

Ser Asp Pro Lys His Pro He Ser Pro Ala He Phe Phe Ser Gly Asn 
20 25 30 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn He Arg Asn Leu Arg Phe 
35 40 45 



Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu He He Ala Ala Thr His 
50 55 60 



Glu Ser His "Val 
65 

Gin Met Lys lie 



Val Thr Tyr Ser 
100 

Arg Ser Val Asp 
115 

Gly Ala lie Leu 
130 

Thr Leu Ala Tyr 
145 

His lie Ser Gly 



Thr Val Asp Asn 
180 

lie Leu Asp Arg 
195 

Gly Gly Gly Gly 
210 

Leu Val Glu Val 
225 

Leu Glu Gin Asn 



Pro Lys Leu Pro 
260 

Asn Gly Thr Val 
275 

Met Phe Leu Gly 
290 

Phe Pro Glu Leu 
305 

lie Gin Ser Val 



Thr Leu Leu Leu 
340 

Ser Asp Tyr Val 
355 

Trp Lys Lys Met 
370 



Gin Ala Ala lie 
70 

Arg Ser Gly Gly 
85 

Gly Lys Pro Phe 



Val Asp Val Ala 
120 

Gly Glu Val Tyr 
135 

Pro Ala Gly lie 
150 

Gly Gly Tyr Gly 
165 

Thr He Asp Ala 



Lys Leu Met Gly 
200 

Gly Ser Tyr Gly 
215 

Pro Glu Asn Val 
230 

Ala Thr Asp He 
245 

Asp Glu Leu Phe 



Ser Ser Gin Lys 
280 

Asp Thr Thr Thr 
295 

Gly Leu Val Arg 
310 

Leu Phe Trp Thr 
325 

Gin Arg Asn Gin 



Arg Glu Pro He 
360 

He Glu Leu Glu 
375 



Thr Cys Gly Lys 
75 

His Asp Tyr Asp 
90 

Phe Val Leu Asp 
105 

Ser Lys Thr Ala 



Tyr Tyr He Trp 
140 

Cys Pro Thr Val 
155 

Asn Met Met Arg 
170 

Arg Met Val Asp 
185 

Glu Asp Leu Tyr 



Val Val Leu Ala 
220 

Thr Val Phe Arg 
235 

He His Arg Trp 
250 

He Arg Thr Val 
265 

Thr Val Arg Thr 



Leu Leu Ser He 
300 

Ser Asp Cys Thr 
315 

Asn He Gin Val 
330 

Pro Val Asn Tyr 
345 

Ser Arg Thr Gly 



He Pro Thr Met 
380 



Arg His Asn Leu 
80 

Gly Leu Ser Tyr 
95 

Met Phe Asn Leu 
110 

Trp Val Gin Thr 
125 

Glu Lys Ser Lys 



Gly Val Gly Gly 
160 

Lys Tyr Gly Leu 
175 

Val Asn Gly Lys 
190 

Trp Ala He Asn 
205 

Tyr Lys He Asn 



He Ser Arg Thr 
240 

Gin Gin Val Ala 
255 

He Asp Val Val 
270 

Thr Phe He Ala 
285 

Leu Asn Arg Arg 



Glu Thr Ser Trp 
320 

Gly Ser Ser Glu 
335 

Leu Lys Arg Lys 
350 

Leu Glu Ser He 
365 

Ala Phe Asn Pro 



Tyr Gly Gly Glu Met Gly Arg lie Ser Leu Arg Val Thr Pro Phe Pro 
385 390 395 400 



Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

Asn Asn Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

Phe Phe Arg Asn Glu His Ser lie Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1524 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 4 8 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
1 5 10 15 

TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 
Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 
Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 



AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 
Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie He Ala Ala Thr His 
50 55 60 

GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 240 
Glu Ser His Val Gin Ala Ala He Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 28 8 
Gin Met Lys He Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 3 36 
Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 HO 

CGT TCG GTG GAT GTC GAC GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 3 84 
Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 4 32 
Gly Ala He Leu Gly Glu Val Tyr Tyr Tyr He Trp Glu Lys Ser Lys 
130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 480 
Thr Leu Ala Tyr Pro Ala Gly He Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 

CAT ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAC GGT CTC 52 8 
His He Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTA AAT GGA AAA 576 
Thr Val Asp Asn Thr He Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAC TGG GCA ATA AAC 624 
He Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala He Asn 
195 200 205 

GGA GGA GGA GGA GGG AGC TAC GGC GTC GTA TTG GCC TAC AAA ATA AAC 672 
Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys He Asn 
210 * 215 220 

CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 72 0 
Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg He Ser Arg Thr 
225 230 235 240 

TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 76 8 
Leu Glu Gin Asn Ala Thr Asp He He His Arg Trp Gin Gin Val Ala 
245 250 255 

CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAC GTA GTA 816 
Pro Lys Leu Pro Asp Glu Leu Phe He Arg Thr Val He Asp Val Val 
260 265 270 

AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 864 
Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe He Ala 
275 280 285 



ATG TTT CTA GGA GAC ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 912 
Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser lie Leu Asn Arg Arg 
290 295 300 

TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 96 0 
Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 10 08 
lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 

ACA CTT CTA CTC CAA AGG AAT CAA CCC GTG AAC TAC CTC AAG AGG AAA 10 56 
Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

TCA GAT TAC GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 1104 
Ser Asp Tyr Val Arg Glu Pro He Ser Arg Thr Gly Leu Glu Ser He 
355 360 365 

TGG AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 1152 
Trp Lys Lys Met He Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
370 375 380 

TAC GGT GGT GAG ATG GGG AGG ATA TCA TCT ACG GTG ACT CCG TTC CCA 12 00 
Tyr Gly Gly Glu Met Gly Arg He Ser Ser Thr Val Thr Pro Phe Pro 
385 390 395 400 

TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 124 8 
Tyr Arg Ala Gly Asn Leu Trp Lys He Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 12 96 
Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 1344 
Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT TAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 13 92 
Asn Tyr Arg Asp Val Asp Leu Gly He Asn Ser His Asn Gly Lys He 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GGA GGT 144 0 
Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 14 88 
Asn Phe Glu Arg Leu Val Lys He Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

TTC TTT AGG AAC GAA CAG AGT ATT CCT GTG TTA CCA TAA 152 7 

Phe Phe Arg Asn Glu Gin Ser He Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO : 73: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 508 amino acids 



(B) TYPE : amino acid * " " 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Thr Ser Arg Arg Asn Ser- Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 I 5 

Ser Asp Pro Lys His Pro He Ser Pro Ala He Phe Phe Ser Gly Asn 
20 25 30 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn He Arg Asn Leu Arg Phe 
35 40 45 

Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu He He Ala Ala Thr His 
50 55 60 

Glu Ser His Val Gin Ala Ala He Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

Gin Met Lys He Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
.100 105 HO 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

Gly Ala He Leu Gly Glu Val Tyr Tyr Tyr He Trp Glu Lys Ser Lys 
130 135 140 

Thr Leu Ala Tyr Pro Ala Gly He Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 

His He Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

Thr Val Asp Asn Thr He Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

He Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala He Asn 
195 200 205 

Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys He Asn 
210 215 220 

Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg He Ser Arg Thr 
225 230 235 240 

Leu Glu Gin Asn Ala Thr Asp He He His Arg Trp Gin Gin Val Ala 
245 250 255 

Pro Lys Leu Pro Asp Glu Leu Phe He Arg Thr Val He Asp Val Val 
260 265 270 

Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe He Ala 
275 280 285 



Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser He Leu Asn Arg Arg 
290 295 300 

Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

He Gin Ser Val Leu Phe Trp Thr Asn He Gin Val Gly Ser Ser Glu 
325 330 335 

Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

Ser Asp Tyr Val Arg Glu Pro He Ser Arg Thr Gly Leu Glu Ser He 
355 360 365 

Trp Lys Lys Met He Glu Leu Glu He Pro Thr Met Ala Phe Asn Pro 
370 375 380 

Tyr Gly Gly Glu Met Gly Arg He Ser Ser Thr Val Thr Pro Phe Pro 
385 390 395 400 

Tyr Arg Ala Gly Asn Leu Trp Lys He Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

Asn Tyr Arg Asp Val Asp Leu Gly He Asn Ser His Asn Gly Lys He 
450 455 460 



Ser Ser Tyr 



Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

Asn Phe Glu Arg Leu Val Lys He Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

Phe Phe Arg Asn Glu Gin Ser He Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 ine ar 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

<B) LOCATION: 1..1527 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TCC ATC CAA GAT CAA TTC ATA AAC TGT GTC AAA AGA AAC ACA CAT GTT 4 8 

Ser lie Gin Asp Gin Phe lie Asn Cys Val Lys Arg Asn Thr His Val 
1 5 10 15 

TCT TTT CCA CTC GAG AAA ACG TTA TTC ACC CCT GCG AAA AAC GTC TCT 96 
Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 25 30 

TTG TTC AAC CAA GTC CTT GAA TCG ACG GCT CAA AAT CTC CAG TTC TTG 144 
Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
35 40 45 

GCA AAA TCC ATG CCT AAA CCG GGA TTC ATA TTC AGA CCG ATT CAC CAG 192 
Ala Lys Ser Met Pro Lys Pro Gly Phe lie Phe Arg Pro He His Gin 
50 55 60 

TCT CAA GTC CAA GCT TCC ATC ATT TGT TCA AAG AAA CTC GGA ATT CAT 24 0 
Ser Gin Val Gin Ala Ser He lie Cys Ser Lys Lys Leu Gly He His 
65 70 75 80 

TTT CGT GTT AGA AGT GGC GGT CAC GAT TTC GAG GCC TTG TCT TAT GTT 288 
Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 
85 90 95 

TCA CGG ATT GAA AAA CCG TTT ATA TTA CTC GAC CTG TCA AAA TTG AAA 3 36 
Ser Arg He Glu Lys Pro Phe He Leu Leu Asp Leu Ser Lys Leu Lys 
100 105 HO 

CAA ATC AAT GTT GAT ATT GAA TCC AAT AGT GCT TGG GTT CAA CCT GGT 3 84 
Gin He Asn Val Asp He Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 125 

GCT ACG CTT GGT GAG CTT TAC TAC AGA ATT GCA GAG AAG AGC AAG ATC 43 2 
Ala Thr Leu Gly Glu Leu Tyr Tyr Arg He Ala Glu Lys Ser Lys He 
130 135 140 

CAT GGA TTT CCC GCG GGT TTG TGC ACA AGT GTA GGC ATA GGT GGG TAT 48 0 
His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly He Gly Gly Tyr 
145 150 155 160 

ATG ACA GGC GGT GGA TAC GGT ACC TTG ATG AGG AAG TAT GGT CTT GCG 52 8 
Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 170 175 

GGA GAT AAT GTT CTA GAC GTA AAG ATG GTT GAT GCA AAT GGT AAA TTA 576 
Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 190 

CTC GAC AGA GCC GCG ATG GGT GAG GAC CTA TTT TGG GCG ATT AGA GGA 624 
Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala He Arg Gly 
195 200 205 

GGC GGT GGA GCG AGT TTC GGG ATA GTT CTA GCA TGG AAG ATC AAG CTT 6 72 
Gly Gly Gly Ala Ser Phe Gly He Val Leu Ala Trp Lys He Lys Leu 
210 215 220 



GTT CCT GTT CCT AAG ACT GTT ACC GTC TTC ACT GTC ACC AAA ACG TTA 72 0 
Val Pro Val Pro Lys'Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235 240 

GAA CAA GAC GCA AGA TTG AAG ACT ATT TCT AAG TGG CAA CAA ATT TCA 76 8 
Glu Gin Asp Ala Arg Leu Lys Thr He Ser Lys Trp Gin Gin He Ser 
245 250 255 

TCC AAG ATT ATT GAA GAG ATA CAC ATC CGA GTG GTA CTC AGA GCA GCT 816 
Ser Lys He He Glu Glu He His He Arg Val Val Leu Arg Ala Ala 
260 265 270 

GGA AAT GAT GGA AAC AAG ACT GTG ACA ATG ACC TAC CTA GGT CAG TTT 864 
Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

CTT GGC GAG AAA GGC ACC TTG CTG AAG GTT ATG GAG AAG GCT TTT CCA 912 
Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

GAA CTA GGG TTA ACT CAA AAG GAT TGT ACT GAA ATG AGC TGG ATT GAA 96 0 
Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp He Glu 
305 310 315 320 

GCC GCC CTT TTC CAT GGT GGA TTT CCA ACA GGT TCT CCT ATT GAA ATT 100 8 
Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro He Glu He 
325 330 335 

TTG CTT CAG CTC AAG TCG CCT CTA GGA AAA GAT TAC TTC AAA GCA ACG 1056 
Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 345 350 

TCG GAT TTC GTT AAA GAA CCT ATT CCT GTG ATA GGC TTC AAA GGA ATA 1104 
Ser Asp Phe Val Lys Glu Pro He Pro Val He Gly Phe Lys Gly He 
355 360 365 

TTC AAA AGA TTG ATT GAA GGA AAC ACA ACA TTT CTG AAC TGG ACT CCT 1152 
Phe Lys Arg Leu He Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 

TAC GGT GGT ATG ATG TCG AAA ATC CCT GAA TCT GCG ATC CCA TTT CCG 12 00 
Tyr Gly Gly Met Met Ser Lys He Pro Glu Ser Ala He Pro Phe Pro 
385 390 395 400 

CAT AGA AAC GGA ACC CTC TTC AAG ATT CTC TAT TAC GCG AAC TGG CTA 124 8 
His Arg Asn Gly Thr Leu Phe Lys He Leu Tyr Tyr Ala Asn Trp Leu 
405 410 415 

GAG AAT GAC AAG ACA TCG AGT AGA AAA ATC AAC TGG ATC AAA GAG ATA 12 96 
Glu Asn Asp Lys Thr Ser Ser Arg Lys He Asn Trp He Lys Glu He 
420 425 430 

TAC AAT TAC ATG GCG CCT TAT GTC TCA AGC AAT CCA AGA CAA GCA TAT 1344 
Tyr Asn Tyr Met Ala Pro Tyr Val Ser Ser Asn Pro Arg Gin Ala Tyr 
435 440 445 

GTG AAC TAC AGA GAT CTA GAC TTC GGA CAG AAC AAG AAC AAC GCA AAG 13 92 
Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 
450 455 460 



GTT AAC TTC ATT GAA GCT AAA ATC TGG GGA CCT AAG TAC TTC AAA GGC 



1440 



Val Asn Phe lie Glu Ala Lys He Trp Gly Pro Lys Tyr Phe Lys Gly * - 
465 470 475 480 

AAT TTT GAC AGA TTG GTG AAG ATT AAA ACC AAG GTT GAT CCA GAG AAC 1488 
Asn Phe Asp Arg Leu Val Lys He Lys Thr Lys Val Asp Pro Glu Asn 
485 490 495 

TTC TTC AGG CAC GAG CAG AGT ATC CCA CCT ATG CCC TAC TAG 15 3 0 

Phe Phe Arg His Glu Gin Ser He Pro Pro Met Pro Tyr 
500 505 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ser He Gin Asp Gin Phe He Asn Cys Val Lys Arg Asn Thr His Val 
1 5 10 15 

Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 25 30 

Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
35 40 45 

Ala Lys Ser Met Pro Lys Pro Gly Phe He Phe Arg Pro He His Gin 
50 55 60 

Ser Gin Val Gin Ala Ser He He Cys Ser Lys Lys Leu Gly He His 



65 



70 75 80 



Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 
85 90 95 

Ser Arg He Glu Lys Pro Phe He Leu Leu Asp Leu Ser Lys Leu Lys 
100 105 110 

Gin He Asn Val Asp He Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 125 

Ala Thr Leu Gly Glu Leu Tyr Tyr Arg He Ala Glu Lys Ser Lys He 
130 135 140 

His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly He Gly Gly Tyr 
145 150 155 160 

Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 170 175 

Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 190 



Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala He Arg Gly 
195 200 205 



Gly Gly Gly Ala Ser Phe Gly lie Val Leu Ala Trp Lys lie Lys Leu 
210 215 * 220 

Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235 240 

Glu Gin Asp Ala Arg Leu Lys Thr lie Ser Lys Trp Gin Gin lie Ser 
245 250 255 

Ser Lys lie He Glu Glu He His He Arg Val Val Leu Arg Ala Ala 
260 265 270 

Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp He Glu 
305 310 315 320 

Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro He Glu He 

325 330 335 

Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 345 350 

Ser Asp Phe Val Lys Glu Pro He Pro Val He Gly Phe Lys Gly He 
355 360 365 

Phe Lys Arg Leu He Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 

Tyr Gly Gly Met Met Ser Lys He Pro Glu Ser Ala He Pro Phe Pro 
385 390 395 400 

His Arg Asn Gly Thr Leu Phe Lys He Leu Tyr Tyr Ala Asn Trp Leu 



405 



410 415 



Glu Asn Asp Lys Thr Ser Ser Arg Lys He Asn Trp He Lys Glu He 
420 425 430 

Tyr Asn Tyr Met Ala Pro Tyr Val Ser Ser Asn Pro Arg Gin Ala Tyr 
435 440 445 

Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 
450 455 460 

Val Asn Phe He Glu Ala Lys He Trp Gly Pro Lys Tyr Phe Lys Gly 
465 470 475 480 

Asn Phe Asp Arg Leu Val Lys He Lys Thr Lys Val Asp Pro Glu Asn 
485 490 495 



Phe Phe Arg His Glu Gin Ser He Pro Pro Met Pro Tyr 
500 505 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: STUIVER, Maarten Hendrik 

CUSTERS, Jerome Humbertina Henricus Victor 
SELA- BURLAGE , Marianne Beatrix 
MELCHERS , Leo Sjoerd 

VAN DEVENTER-TROOST, Johanna Pieternella 
LAGEWEG , We s s e 1 
PONSTEIN, Anne Silene 
LAGEWEG , Wessel 
PONSTEIN, Anne Silene 

Hi) TITLE OF INVENTION: ANTIFUNGAL PROTEINS, DNA CODING 

THEREFOR, AND HOSTS INCORPORATING 
SAME. 

(iii) NUMBER OF SEQUENCES : 75 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAD AS & PARRY 

(B) STREET: 26 WEST 61 STREET 

(C) CITY: NEW YORK 

(D) STATE: NY 

(E) COUNTRY: USA 

(F) ZIP: 10023 - 7604 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.25" Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: WINDOWS 95 

(D) SOFTWARE: WORDPERFECT 8 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/258,031 

(B) FILING DATE: 25-FEB-1999 

(C) CLASSIFICATION: 435 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/EP97/04923 

(B) FILING DATE: 04-SEP-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP97200831.2 

(B) FILING DATE: 19-MAR-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP96202466.7 

(B) FILING DATE: 04-SEP-1996 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv. zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

Ser He Asn Val Asp He Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
15 10 15 

Ala Thr Leu Gly Glu Val Tyr Tyr Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv. zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr Pro Gly Xaa 
1 5 10 15 

Ser Phe Pro Thr Val Leu Gin Asn Tyr 
20 25 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 



(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
AACTTCTCCN AGNGTNGCNC CNGCTTGNAC CCA 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 

<ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GATCCNTCTT TCCCNATTAC TGGNGAGGTT TA 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv. zebulon 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TCT TTC CCG ATT ACT GGG GAG GTT TAC ACT CCC GGA AAC TCA 
Ser Phe Pro He Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser 
c 10 15 



GAT CCG 
Asp Pro 
1 



TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC CTT^ CGG TTC AAT 96 
Ser Phe Pro Thr Val Leu Gin Asn Tyr He Arg Asn Leu Arg Phe Asn 
20 25 30 

GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT 144 
Glu Thr Thr Thr Pro Lys Pro Phe Leu He He Thr Ala Glu His Val 
35 40 45 

TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA 192 
Ser His He Gin Ala Ala Val Val Cys Gly Lys Gin Asn Arg Leu Leu 
50 55 60 

CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT 240 
Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu 
65 70 75 80 

ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG 288 
Thr Asn Thr Asn Gin Pro Phe Phe He Val Asp Met Phe Asn Leu Arg 

85 90 95 



TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG GTC CAA GCC GGC 
Ser lie Asn Val Asp He Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
100 105 HO 

GCC ACC CTC GGA GAA GTT 
Ala Thr Leu Gly Glu Val 
115 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 



336 



354 



Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser 
1 5 10 15 

Ser Phe Pro Thr Val Leu Gin Asn Tyr He Arg Asn Leu Arg Phe Asn 
20 25 30 

Glu Thr Thr Thr Pro Lys Pro Phe Leu He He Thr Ala^Glu His Val 
35 40 45 

Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin Asn Arg Leu Leu 
50 55 60 

Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu 

65 70 75 

Thr Asn Thr Asn Gin Pro Phe Phe He Val Asp Met Phe Asn Leu Arg 



90 9 5 
85 90 



Ser lie Asn Val Asp He Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
ioo 105 110 



Ala Thr Leu Gly Glu Val 
115 



(2) INFORMATION FOR SEQ ID NO : 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function^ "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAGGCAGCTG TGGTTTGTGG C 



(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 
GTCCACAATG AAGAAGGGTT G 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: cDNA 



(ili) HYPOTHETICAL: NO * * 
(iii) ANTI -SENSE: NO 

(ix) FEATURE : 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function^ "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 
ACGTAGATAT CGAACAAGAA ACCGC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCTTTACTAC ACGGGCTTCC CCAG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA - 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTGGGGAAGC CCGTGTAGTA AAGC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



24 



24 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GGTACTCCAA CCACGGCGCT C 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CGGGAAGTTG CAGAAGATTG GGTTG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE. TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAGCAAGAGA AGAAGGAGAC 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 21.. 1608 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATATCACATC TTCTTTCAAC ATG CAA ACT TCC ATT CTT ACT CTC CTT CTT 50 

Met Gin Thr Ser lie Leu Thr Leu Leu Leu 
1 5 10 

CTC TTG CTC TCA ACC CAA TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT 98 
Leu Leu Leu Ser Thr Gin Ser Ser Ala Thr Ser Arg Ser lie Thr Asp 
15 20 25 

CGC TTC ATT CAA TGT TTA CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA 146 
Arg Phe lie Gin Cys Leu His Asp Arg Ala Asp Pro Ser Phe Pro lie 
30 35 40 

ACC GGA GAG GTT TAC ACT CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG 194 
Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser Ser Phe Pro Thr Val Leu 
45 50 55 

CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA 242 
Gin Asn Tyr He Arg Asn Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys 
60 65 70 

CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT 2 90 
Pro Phe Leu He lie Thr Ala Glu His Val Ser His He Gin Ala Ala 
75 80 85 90 

GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT 33 8 
Val Val Cys Gly Lys Gin Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly 

95 100 105 

GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC 386 
Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro 
110 115 120 

TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC 434 
Phe Phe lie Val Asp Met Phe Asn Leu Arg Ser He Asn Val Asp He 
125 130 135 

GAA CAA GAA ACC GCA TGG GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG 4 82 
Glu Gin Glu Thr Ala Trp Val Gin Ala Gly Ala Thr Leu Gly Glu Val 
140 145 150 

TAC TAT CGA ATA GCG GAG AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG 53 0 
Tyr Tyr Arg He Ala Glu Lys Ser Asn Lys His Gly Phe Pro Ala Gly 
155 160 165 170 

GTT TGT CCA ACG GTT GGC GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT 5 78 
Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly Gly Tyr 
175 180 185 

GGT AAT TTG ATG AGA AAA TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT 626 
Gly Asn Leu Met Arg Lys Tyr Gly Leu Ser Val Asp Asn He Val Asp 
190 195 200 



GCT CAA ATA ATA OAT GTG" AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG 674 
Ala Gin lie He Asp Val Asn Gly Lys Leu Leu Asp Arg Lys Ser Met 
205 210 215 

GGT GAG GAT TTG TTT TGG GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT 72 2 
Gly Glu Asp Leu Phe Trp Ala He Thr Gly Gly Gly Gly Val Ser Phe 
220 225 230 

GGT GTG GTT CTA GCC TAC AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT 770 
Gly Val Val Leu Ala Tyr Lys He Lys Leu Val Arg Val Pro Glu Val 
235 240 245 250 

GTG ACC GTG TTT ACC ATT GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC 818 
Val Thr Val Phe Thr He Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr 
255 260 265 

ATC GCG GAA CGA TGG GTA CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT 866 
He Ala Glu Arg Trp Val Gin Val Ala Asp Lys Leu Asp Arg Asp Leu 
270 275 280 

TTC CTT CGA ATG ACC TTT AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG 914 
Phe Leu Arg Met Thr Phe Ser Val He Asn Asp Thr Asn Gly Gly Lys 
285 290 295 

ACA GTC CGT GCT ATC TTT CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT 962 
Thr Val Arg Ala He Phe Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn 
300 305 310 

CTT GTT ACA CTT TTG AAT AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA 1010 
Leu Val Thr Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu 
315 320 325 330 

TCG GAT TGT ACT GAA ATG AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG 1058 
Ser Asp Cys Thr Glu Met Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr 
335 340 345 

GGC TTC CCC AGT GGT ACT CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT 1106 
Gly Phe Pro Ser Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro 

360 



350 



355 



CAA AGA CTC AAC CCA TTC AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT 1154 
Gin Arg Leu Asn Pro Phe Lys He Lys Ser Asp Tyr Val Gin Asn Pro 
365 370 375 

ATT TCT AAA CGA CAG TTC GAG TTC ATC TTC GAA AGG CTG AAA GAA CTT 1202 
He Ser Lys Arg Gin Phe Glu Phe He Phe Glu Arg Leu Lys Glu Leu 
380 385 390 

GAA AAC CAA ATG TTG GCT TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA 1250 
Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly, Arg Met Ser Glu 
395 400 405 410 

ATA TCC GAA TTC GCA AAG CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG 1298 
He Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn He Ala 
415 420 425 

AAA ATT CAA TAC GAA GTA AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA 1346 
Lys He Gin Tyr Glu Val Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu 
430 435 440 



AAT CGT TAC TTG AAT TTC ACA AGG CTG ATG TAT GAT TAC ATG ACC CCA 13 94 
Asn Arg Tyr Leu Asn Phe Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro 
445 450 455 

TTT GTG TCG AAA AAC CCT AGA AAA GCA TTT TTG AAC TAT AGG GAT TTG 1442 
Phe Val Ser Lys Asn Pro Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu 
460 465 470 

GAT ATT GGT ATC AAC AGC CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG 14 90 
Asp He Gly He Asn Ser His Gly Arg Asn Ala Tyr Thr Glu Gly Met 
475 480 485 490 

GTT TAT GGG CAC AAG TAT TTC AAA GAG ACA AAT TAC AAG AGG CTA GTA 1538 
Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val 
495 500 505 

AGT GTG AAG ACT AAA GTT GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA 1586 
Ser Val Lys Thr Lys Val Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin 
510 515 520 

AGC ATC CCA ACT TTG TCA TCT T GAAGAACGTA CATATATAAA TAAATACCTT 163 8 
Ser lie Pro Thr Leu Ser Ser 
525 

TGTGCATGGT ATTTTCAGGG TGTTAAAGTG ATATTCAGAT ATTTATGATA GAATTTTGAC 16 98 
TTGTATTTTA TACAATCAAA ATTGTATGGT TCTCCGAATT TCTCTTTTTA ATTCTGAAAA 175 8 
ATACATATTA GTATTGTCAA AAAAAA 1784 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Gin Thr Ser He Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
1 5 ' 10 15 

Ser Ser Ala Thr Ser Arg Ser He Thr Asp Arg Phe He Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr 
35 40 45 

Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr He Arg Asn 
50 55 60 



Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu He He Thr 
Ala Glu His Val Ser His He Gin Ala Ala Val Val Cys Gly Lys Gin 



# 



Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 
130 135 140 

Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

Tyr Gly Leu Ser Val Asp Asn lie Val Asp Ala Gin lie lie Asp Val 
195 200 205 

Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

Ala He Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 

Lys He Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr He 
245 250 255 

Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr He Ala Glu Arg Trp Val 
260 265 270 

Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

Ser Val He Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala He Phe 
290 295 300 

Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 

Lys He Lys Ser Asp Tyr Val Gin Asn Pro He Ser Lys Arg Gin Phe 
370 375 380 

Glu Phe He Phe Glu Arg Leu Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 

Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu He Ser Glu Phe Ala Lys 
405 410 415 



Pro Phe Pro "His 
420 

Asn Trp Glu Asp 
435 



Arg Ser Gly Asn 



Leu Ser Asp Glu 
440 



lie Ala Lys lie 
425 

Ala Glu "Asn Arg 



Gin Tyr Glu Val 
430 

Tyr Leu Asn Phe 
445 



Thr Arg Leu Met Tyr Asp 
450 

Arg Lys Ala Phe Leu Asn 
465 470 



Tyr Met Thr Pro Phe 
455 

Tyr Arg Asp Leu Asp 
475 



Val Ser Lys Asn Pro 
460 

lie Gly lie Asn Ser 
480 



His Gly Arg Asn 



Phe Lys Glu Thr 
500 

Asp Pro Asp Asn 

515 



Ala Tyr Thr Glu 
485 

Asn Tyr Lys Arg 



Phe Phe Arg Asn 
520 



Gly Met Val Tyr 
490 

Leu Val Ser Val 
505 

Glu Gin Ser He 



Gly His Lys Tyr 
495 

Lys Thr Lys Val 
510 

Pro Thr Leu Ser 
525 



Ser 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCGCCATGGA GACTTCCATT CTTACTC 



(2) INFORMATION FOR SEQ ID NO : 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCCGGATCCT CAAGATGACA AAGTTGGGAT GCT 



0 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO $ 
(iii) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1590 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



ATG GAG ACT TCC ATT CTT ACT CTC CTT CTT CTC TTG CTC TCA ACC CAA 48 
Met Glu Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
1 5 10 15 

TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT CGC TTC ATT CAA TGT TTA 96 
Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 
20 25 30 

CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA ACC GGA GAG GTT TAC ACT 144 
His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 



CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC 192 
Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 

CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA 240 
Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 

GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA 288 
Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 

85 90 95 

AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT 336 
Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG 384 
Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG 432 
Phe Asn Leu Arg Ser He Asn Val Asp He Glu Gin Glu Thr Ala Trp 
130 135 140 



m 



vl± CTT > m GAA GT ° TAC TAT CGA ATA GCG GAG 

Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg He Ala Glu 

150 155 160 

AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG GTT TGT CCA ACG GTT GGC 
Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT GGT AAT TTG ATG AGA AAA 
Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT GCT CAA ATA ATA GAT GTG 
Tyr Gly Leu Ser Val Asp Asn He Val Asp Ala Gin He He Asp Val 
195 200 205 

AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG GGT GAG GAT TTG TTT TGG 
Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 



480 



528 



576 



624 



672 



GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT GGT GTG GTT CTA GCC TAC 72 0 
Ala He Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 

AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT GTG ACC GTG TTT ACC ATT 768 
Lys He Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr He 
245 250 255 

GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC ATC GCG GAA CGA TGG GTA 816 
Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr He Ala Glu Arg Trp Val 
260 265 270 

CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT TTC CTT CGA ATG ACC TTT 864 
Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG ACA GTC CGT GCT ATC TTT 912 
Ser Val He Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala He Phe 
290 295 300 

CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT CTT GTT ACA CTT TTG AAT 960 
Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA TCG GAT TGT ACT GAA ATG 1008 
Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG GGC TTC CCC AGT GGT ACT 1056 
Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT CAA AGA CTC AAC CCA TTC 1104 
Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 



AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT ATT TCT AAA CGA CAG TTC 1152 
Lys He Lys Ser Asp Tyr Val Gin Asn Pro He Ser Lys Arg Gin Phe 
370 375 380 



GAG 'ETC ATC TTC GAA AGG ATG AAA GAA 
Glu Phe lie Phe Glu Arg Met Lys Glu 
385 390 

TTC AAC CCA TAT GGT GGT AGA ATG AGT 
Phe Asn Pro Tyr Gly Gly Arg Met Ser 
405 

CCT TTC CCA CAT AGA TCG GGT AAC ATA 
Pro Phe Pro His Arg Ser Gly Asn lie 
420 425 



CTT GAA AAC CAA ATG TTG GCG 12 00 
Leu Glu Asn Gin Met Leu Ala 
395 400 

GAA ATA TCC GAA TTC GCA AAG 124 8 
Glu lie Ser Glu Phe Ala Lys 
410 415 

GCG AAG ATT CAA TAC GAA GTA 12 96 
Ala Lys lie Gin Tyr Glu Val 
430 



AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA AAT CGT TAC TTG AAT TTC 1344 
Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 

ACA AGG CTG ATG TAT GAT TAC ATG ACT CCA TTT GTG TCG AAA AAC CCT 13 92 
Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
450 455 460 

AGA GAA GCA TTT TTG AAC TAT AGG GAT TTG GAT ATT GGT ATC AAC AGC 144 0 
Arg Glu Ala Phe Leu Asn Tyr Arg Asp Leu Asp lie Gly lie Asn Ser 
465 470 475 480 

CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG GTT TAT GGG CAC AAA TAT 1488 
His Gly Arg Asn Ala Tyr Thr Glu Gly Met Val Tyr Gly His Lys Tyr 
485 490 495 

TTC AAA GAG ACA AAT TAC AAG AGG CTA GTA AGT GTG AAG ACT AAA GTT 1536 
Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val Ser Val Lys Thr Lys Val 
500 505 510 

GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA AGC ATC CCA ACT TTG TCA 1584 
Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin Ser He Pro Thr Leu Ser 
515 520 525 



TCT TG 
Ser 

530 

(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Glu Thr Ser He Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
1 5 10 15 

Ser Ser Ala Thr Ser Arg Ser He Thr Asp Arg Phe He Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr 
35 40 45 



1589 



Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 

Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 

Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
85 90 95 

Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 
130 135 140 

Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

Tyr Gly Leu Ser Val Asp Asn lie Val Asp Ala Gin lie lie Asp Val 
195 200 205 

Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

Ala He Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 

Lys He Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr He 
245 250 255 

Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr He Ala Glu Arg Trp Val 
260 265 270 

Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

Ser Val He Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala He Phe 
290 295 300 

Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 



Lys lie Lys Ser 
370 

Glu Phe lie Phe 
385 

Phe Asn Pro Tyr 



Pro Phe Pro His 
420 

Asn Trp Glu Asp 
435 

Thr Arg Leu Met 
450 

Arg Glu Ala Phe 
465 

His Gly Arg Asn 



Phe Lys Glu Thr 
500 

Asp Pro Asp Asn 
515 

Ser 




Asp "Tyr Val Gin 
375 

Glu Arg Met Lys 
390 

Gly Gly Arg Met 
405 

Arg Ser Gly Asn 



Leu Ser Asp Glu 
440 

Tyr Asp Tyr Met 
455 

Leu Asn Tyr Arg 
470 

Ala Tyr Thr Glu 
485 

Asn Tyr Lys Arg 



Phe Phe Arg Asn 
520 



As n Pro lie S e r 
380 

Glu Leu Glu Asn 
395 

Ser Glu lie Ser 
410 

lie Ala Lys lie 
425 

Ala Glu Asn Arg 



Thr Pro Phe Val 
460 

Asp Leu Asp lie 
475 

Gly Met Val Tyr 
490 

Leu Val Ser Val 
505 

Glu Gin Ser lie 



Lys Arg Gin Phe 



Gin Met Leu Ala 
400 

Glu Phe Ala Lys 
415 

Gin Tyr Glu Val 
430 

Tyr Leu Asn Phe 
445 

Ser Lys Asn Pro 



Gly lie Asn Ser 
480 

Gly His Lys Tyr 
495 

Lys Thr Lys Val 
510 

Pro Thr Leu Ser 
525 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 350 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GAGAAACTCG GAGACTTTCA CACAATGCCT AACCTCAAAC TCCGACCCCA AACATCCCAT 60 
CTCCCCCGCT ATCTTCTTCT CCGGAAATGG CTCCTACTCC TCCGTATTAC AAGC CAACAT 12 0 



CCGTAACCTC CGCTTCAACA CCACCTCAAC TCCGAAACCC TTCCTCATAA TCGCCGCAAC 180 



ACATGAATCC CATGTGCAAG CCGCGATTAC TTGCGGGAAA CGCCACAACC TTCAGATGAA 24 0 



AATCAGAAGT GGAGGCCACG ACTACGATGG CTTGTCATAC GTTACATACT CTGGCAAACC 3 00 
GTTCTTCGTC CTCGACATGT TTAACCTCCG TTCGGTGGAT GTCGACGTGG 3 50 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 278 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGCATGGATC TCCGCCGGAG CGACTCTCGG AGAGGTTTAT TATCGGATTT GGGAGAAAAG 60 
CAGAGTCCAT GGATTCCCCG CCGGAGTTTG ACCGACGGTT GGTGTTGGTG GGCATTTAAG 12 0 
CGGCGGTGGT TACGGTAACA TGGTGAGGAA GTTTGGATTA TCTGTGGATT ACGTTGAGGA 18 0 
TGCCAAGATC GTCGATGTAA ACNGTCGGGT TTTAGATCGG AAAGCAATGG GTGAGGATCT 24 0 
GTTCTGGGCG ATTACCGGTG GAGGAGGAGG TAGCGTAC 278 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 



(ix) FEATURE: 



(A) NAME /KEY : CDS 

(B) LOCATION: 2 . . 345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TGGACATATT AGCGGAGGAG GATTCGGTAC AATAATGAGG AAATACGGTT TAGCGTCTGA 60 
TAACGTTGTG GACGCACGTT TGATGGATGT AAATGGGAAA ACTCTTGACC GGAAAACGAT 12 0 
GGGAGAGGAT TTGTTTTGGG CGCTTAGAGG CGGTGGAGCT GCGAGTTTTG GCGTTGTCTT 180 
GTCGTGGAAG GTTAAGCTTG CTAGGGTTCC TGAAAAGGTA ACTTGTTTCA TAAGTCAACA 240 
TCCGATGGGA CCTAGCATGA ACAAGCTTGT TCATAGATGG CAATCCATAG GATCAAGANN 3 00 
GCTAGACGAA GATTTATTCA TCAGAGTCAA TATTGACAAC AGTCT 345 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
( i i i ) HYPOTHE T I CAL : NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTTCGTTAAA ACCTATCCTN NANGGGCNAA AGNATATCAA AGNTTGNTTA NGNAACCCAA 60 
NATTTCTGAA CTGGCCNCCT TCGGTGGTAT ATGNCNAAAN CCCTTGAATC TGCGNANCCN 12 0 
ATTCCGCATA GAAACGGAAC CCTCTTCAAG ATTCTCTATT TACNCGAACT GNCTAGANNG 180 
AATGACAAGA CATCGAGTAG NAAAATCAAC TGGATCAAAG AGATATACAA TTACATGGCG 24 0 
CCTTATGTCT CAAGCAATCC AAGACAAGCA TATGTGAACT ACAGAGATCT AGACTTCGGA 300 
CAGAACAAGA ACAACGCAAA GGTTAACTTC ATTGAAGCTA AAATCTGGGG ACCTAAGTAC 360 
TTCAAAGGCA ATTTTGACAG ATTGGTGAAG ATTAAAACCA AGGTTGATCC AGAGAACTTC 420 
TTCAGGCACG AGCAGAGTAT CCCACCTATG CCCTACTAGA AGCTAGGTTC ATGAAACCAA 480 
TAACATTATC AAAAATAAGR ATAAATGRTA ATTGTATACA ACATGATTCG KCTTTCTTTA 540 
TTTCAGACAA TGTGGACACT ACTCTAAANT AAAAWGTCNA TTTACCTTAA AAAAAAAATA 600 



• 



ATCCCCNNTA ANANAAAANT GGGGGGGCCN TTTTTGGGGN TCCCGGTTTT NGGACGGGGN 660 
GCTTTNGGGG GGCTTGGNNT TTTTTTNGGN GCCCC 695 

(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 495 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCTGTTTTNA GGCAGAGCAG AGGAAGTTGT TGCTTTGCTT GGTAAGGAGT TTC CTGAATT 6 0 
NAGTTTAAAG AAGGAGAACT GTTCGGAGAT GACTTGGTTT CAGTCAGCTT TATGGTGGGA 12 0 
TAATCGTGTT AACCCTACTC ANATTGATCC WAAAGTGTTT CTCGATCGGA ATCTTGATAG 18 0 
AGCGAATTTC GGAAAGAGGA AATCGGATTA CGTTGCGAGT AAGATTCCTA GAGATGGGAT 240 
TAAGYCTTTT TCCAAGARGA TGMCTGACCT GGGGAAAAYC GGGCTTGTTT TTAAWCCGTA 3 00 
TGGTGGGAAA ATGGCGGAGG TTACGGTTAA CGCGACGCCG TTTCCNCACC GAAGCAAGCT 360 
TTTTAAGATT CAGTACTCGG TGACTTNGCA AGAAAACTCT NTCGAGATAG AGAAAGGGTT 420 
TCTTGAATCA GGCTAACGTC CTTATAGGTT CATGACCGGG TTTTTNAGCA AGANCCCTGG 480 

495 

AATNCTTACT TNAAT 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AAATTAAAAC AAATCAATGT TGATATTGAA TCCAATAGTG CTTGGTTTCA ACCTGGTGCT 60 
ACGCTTGGTG AGCTTTACTA CAGAATTNCA GAGAAGAGCA AAATCCATGG ATTTCCNGCG 12 0 
GGTTTNTNCA CAAGCNTAGG CATAGGTGGG TATATNANAG GCGGTGGATA CGGTACCTTG 18 0 
ATGAGGAAGT ATGGTCTTNC GGGA 2 04 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 91 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GAGATTTCTC GAGCAAGATA CTCCACTGAT GATCTTTGAG CCATTGGGTG GGAAAATCAG 6 0 
CAAGATTTCA GAAACAGAAT CTCCATATCC ACACAGAAGA GGTAATCTGT ATAATATACA 12 0 
GTACATGGTG AAATGGAAAG TGAATGANGT CGAGGAGATG AACAAACATG TCAGGTGGAT 180 
GAGATCGTTA CACGATTACA TGACTCCGTA TGTTTCTAAA TCGCCGAGAG GAGCTTATTT 24 0 
GANTTACAGA GATCTTGATT TGGGCTCGAC CAAAGGGATT AACACGGGTT TCGGAGATGC 300 
AAGGAAATGG NNGGGTGAGN CTTTTTTCAA AGGTAATTTC CAAGGGGTTA GGTTTTGGTT 360 
AAAGGGGAGG TTTNNCCCAN CAAATTTTTT TTCAGGANCC GGCCANGNTT TTCCCCCCCC 420 
TNTTTTTNGG NCCCCAATCN AAANCCCCGT TTTAAAAGGG GGGCCATTTC NTTTTTTNCA 480 
NNTTAAAAGG G 491 



(2) INFORMATION FOR SEQ ID NO : 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 407 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ATTTGTTCGT GAGGTTAACT TTGACTTTAG TCAACGGTAC GAAGCCTGGT GAGAATACGG 6 0 

TTTTAGCGAC TTTCATTGGG ATGTATTTAG GCCGGTCGGA TAAGCTGTTG ACCGTNATGA 12 0 

AC CGGG ATTT CCCGGAGTTG AAGCTGAAGA AAACCGATTN TACCGAGATG AGATGGATCG 18 0 

ATTCGGTTCT GTTTTGGGAC GATTATCCGG TTGGTACACC GACTTCTGTG CTACTAAATC 240 

CGCTAGTCGC AAAAAAGTTG TTCATGAAAC GAAAATCGGA CTACGTGAAG CGTCTNATTT 3 00 

TCGAGAAQCC GATCTCNNGT TTGATACTCA AGAAATTTGT AGAGGTTNNG AAAGTTAAAA 360 

TNAATTTGGA TCCGCATTNN GGNANNNATG GTGAAACCCC NNGTTNT 4 07 



(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 360 



• 



(xi) SEQUENCE DESCRIPTION: S*EQ ID NO: 29: 
ACGGCGTCGT ATTGGCCTAC AAAATAAACC TTGTTGAAGT CCCAGAAAAC GTCACCGTTT 6 0 
TCAGAATCTC CCGGACGTTA GAACAAAATG CGACGGATAT CATTCACCGG TGGCAACAAG 12 0 
TTGCACCGAA GCTTCCCGAC GAGCTTTTCA TAAGANCAGT CATTGACGTA NAAACGGCAC 18 0 
TGTTTCATNN CTCAAAAGAC CGTCAGACAA CATTCATAGC AATGTTTCTA GGAGACACGN 24 0 
CAACTCTACT GTCGATATTA AAC C GG AG AT TCCCAGAATT GGGTTTGGTC CGGTCTGACT 3 00 
GTAC CGNAAC AAGCNNTTGG ATCCAATCTG TGCTATTTTT GGGACAAATA TCCCAGGTTG 36 0 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

<ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3.. 427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TCTTCACTGT CACCAAAACG TTAGAACAAG ACGCAAGATT GAAGACTATT TCTAAGTGGC 60 
AACAAATTTC ATCCAAGATT ATTGAAGAGA TACACATCCG AGTGGTACTC AGAGCAGCTG 12 0 
GAAATGATGG AAACAAGACT GTGACAATGA CCTACCTAGG TCAGTTTCTT GGCGAGAAAG 180 
GCACCTTGCT GAAGGTTATG GAGAAGGCTT TTCCAGAACT AGGGTTAACT CAAAAGGATT 24 0 
GTACTGAAAT GAGCTGGATT GAAGCCGCCC TTTTCCATGG TGGRTTTCCA ACAGGKTCTC 300 
CTATTGAAAT TTTGCTTMAG CTCAAGTCGC CTYTAGGAAA AGRTTWCTTC AAAGCAACGK 360 
CGGATTTCGT TAAAGAACCT WTTCCTGTGA TAGGGCTCAA AGGAATATTC AAAAGATTGA 420 

427 

TTGAAGG 

(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear" 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..437 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTTGTACTAT CATNGAAGAT TAAGTTAGTC GATGTTCCGT CCACGGTCAC CGNGTTTAAA 60 
GTCCAGAAAC ATNAGGAGAA AGAGGCCGTT AGGNTCATCA ACAAGTGGCA GTATGTTGCG 12 0 
GATAAGGTCC CTGAAGATCT TTTCATCAGC GCAACGTTGG NGAGATCAAA CGGAAACTCT 180 
GTGCAGGCTT TGTTTACTGG ACTCTATCTT GGNCCGGTGA AT AATNT CTT GGCCTTGATG 24 0 
GAAGAAAAGT TTCCAGANTT AGGTCTTGAT ATCCAAGNCT GCACAGAGAT GAGTTGGGCT 3 00 
GAATCTGCAC TCTGGTNTNC TGNTTTCNCT AAAGGAGAGN CTCCTTGGGT GTTCCNCGCG 360 
GATCGGNAGC GGNCAATTTN TGGNCTTTCA AGGGGAAAGN CGGCTTTTTN CAAGAACCCG 42 0 

437 

NTACCCGGGG TTCAATT 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..441 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GCGGACCCTA TAGATCANNA TGTGCTACTG ANAGAAGAGG AAGCCAAGAA CAAGCCGGAG 60 
ACAGATAAAT ATCTGAAATG GGNCGATANC GTTTACGAAT TTATGACNCC ATATGTTTCG 120 



AAATCTCCAA GAGGAGCTTA TGTCAATTTC AAGGATATGG ATTTGGGTAT GTATCTTGGA 180 

AAGAAGAAGA CAAAGTACGA GGAAGGAAAG AGTTGGGGAG TGAAGTATTT CAAGAACAAT 24 0 

TTCGAGAGAT TGGTGAGAGT GAAGACTAGG GTTGATCCAA CAGATTTCTT CTGCGATGAA 3 00 

CAGAGCATTC CTCTGGTGAA CAAAGTTACC TGAAGATATC ATTTGAAGTT TTTTATTAGT 3 60 

CCCTTTTCTC TGTGAAATCA TCTGTGCGTG TTGAATATTA TGCGTCAAGT GTGTAACTTA 42 0 

TGTGTGTGAT TGTGAATTGT G 441 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTGGCTTAAC ACAACGTCGT TTTGGGCCAA TTACCCGGCG GGTACACCCA AGAGCATCCT 6 0 
TCTAGATAGG CCTCCGACGA ATTCAGTGTC ATTTAAGAGT AAATCGGATT TTGTCAAAAA 12 0 
ACCAATACCC AAAAAAGGTT TAGAGAAGCT TTGGAAGACA ATGTTTAAAT TCAACAGTAG 180 
CGTCTCGTTG CAATTCAACC CTTACGGTGG AGTGATGGAC CGGATTCCGG CAACGGCCAC 240 
CGCTTTTCCT CATCGGAAAG GAAACTTGTT CAAGGTTCAA TACNCTACGA TGTGGTTTGA 3 00 
CGCAAACGCC ACACAGAGTA GCCNGGCTAT GATGAATGAG CTTTTTGAGG TGGCGGGACC 360 
GTACGTGNGT CAAGTAAACC CGAGANANGG CTTC CTTTAA NTTCAGAGNC CATCGNTNTT 420 
NGGAGCAANN C C AAGTGGGG GGGNCCAACC GGGGGNTNAA ANCNNAGNTC TTNGGGGGCC 480 

_ 502 
CAGAATTTCC TTNGGGGAAT TT 

(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.. 400 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 34: 
NGGGAATTGC NCGAGGNAAG TTGTACCCAA TTCCTGGACC ACCATTGGTT TCCCAAGAAN 60 

CCCGAGACAA CCGTTTTTCA ATNACCGTGA TGTTGATTTG GGTATTAATT CTCATAATGG 12 0 

TAAAATCAGT AGTTATGTGG AAGGTAAACG TTACGGGAAG AAGTATTTCG CAGGTAATTT 18 0 

CGAGAGATTG GTGAAGATTA AGACGAGAGT TGATAGTGGT AATTTCTTTA GGAACGAACA 240 

GAGTATTCCT GTGTTACCAT AAGTGTATTT ATTTGATTAT TGGTTAGTGA AATTTGTTGT 3 00 

TGTATAATGA TTATATGTCG TATTTTTATT TATTATTAGT AATTTATAAA GTTTGATATT 3 60 

AAATACAAAT AGTATAATAA GATAGTTTCT TTTAGTAAAA 4 00 

(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2.-383 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CAACTCTAAT GGGAACACCT ACTTCGATCG AATGTCGATG GGGGAAGAGC TTTTCTGGGC 60 
GGTTCGAGGA GGTGGAGCCG CGAGTTTCGG CATCGTGATG GGATACAAAA TCCGGTTGGT 12 0 



TCCGGTTCCG GAGAAAGTTA CGGTTTTTAG CGTCGGAAAA ACCGTCGGAG AAGGAGCCGT 180 



TGATCTTATA ATGAAGTGGC AGAACTTCTC TCATAGTACG GNTCGGAATT TNTTTGTGAA 240 

GCTGANTTTT GANTTTAGTC AACGGTGCAA AGCCGGGTGA AAAAAAGGTT TTAGNGNCTT 3 00 

TCANTTTGGN TGNAANCTTG GGGGTTTTAT NAGAACGGTT AACCGGGATT NANCCCGNGT 360 

TTTCCCGGGG TTAAAACCTT NGG 383 

(2) INFORMATION FOR SEQ ID NO : 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
ATCAATGTTC TTACTAAACG TACACGAGCA TCGTTGGCTT TCAAGGCTAA ATCTGATTTT 6 0 
NTTCAAGAAC CGATNCCTAA AACCGCGATT TCGAAGCTTT GGAGACGGTT GCAAGAACCG 120 
GAAGCAGAGC ATGCTCAGCT AATTTNCACN CCATTTGGTG GTAAAATGAG TNAGATTGCA 180 
GATTACGAAA CACCATTTCC GCATAGGAAG GGGAATATAT ATNAGATTCA GTACTTGAAT 24 0 
TACTGGAGAG GAGACGTGAA AGAGAAGTAT ATTGAGATNG GTGGAGGAGA GTTTACGGTT 300 
GNTATNAGTA AGTTTTTTGG CGAAGTNTNC CNAGAGGNGN CTTNNTNTAA ACCT * 3 54 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI - SENSE : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidposis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TTTTTTAGTA CACTAATAAT CAAATGGAAT GAGAAATGAA GCCACAAAAG TATCTGCAAT 6 0 
CAAAATATCC TGCTATCTCC ATCTCAAGCT CTCAATAGTA TCCTCTCCGA AAGTGAAATC 12 0 
AACATTTCAA ACTCTATTTC TTGGTGGAAT CGATAGACTG ATTCCTCTGA TGAACCAGAA 180 
GTTTCCGGAA CTCGGCTTAC GATCTCAAGA CTGTTCGGAA ATGAGCTGGA TCGAATCGAT 24 0 
AATGTTCTTC AACTGGAGAT CAGGACAGCC GTTAGAGATT TTGCTCAACA GAGACCTAAG 300 
GATTCGAGGA TCAGTATTTC AAAGCAAAGT CAGGATTATG GTTCAAAAAC CCGTTCCTGA 3 60 
AAACGTTTTT CGAAGAGGTA TCCAAGGGGT TTCTCGAGCA AGT 4 03 

(2) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GAGATGAGTT GGATTAANTC TGTACTCTGG TTTGCTGATT TCCCTAAAGG AGAATCTCTT 60 
NGTGTTCTCA CGAATCGTAA GCGTACATCT CTATCTTTNA AAGGCAAAGA TGATTTTATC 12 0 
CAAGAACCGA T AC C CGAGGC TGCAATTNAA GAGATATGGA GGCGATTAGA AGCCCCCNAG 18 0 
GCTCGGCTTG GAAAGATCAT ATTAACTCCA TTTGGTGGGA AAATNAGTGA AATGGCAGAG 240 

260 

TACGTANCAC CATTCCCACA 



(2) INFORMATION FOR SEQ ID NO : 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2 . . 605 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 
CTCTTGCATA TTCGCTGCAA GGATGGGAAA TTCAAAACCA CTCCCTACAA TTTTTTGTAT 60 
TATAGTTTCA GTCTTGTATT TTTAATTCTA TTGCATAACA CCAACTTCTT CATCAGCCTC 12 0 
CATCCAAGAT CAATTCATAA ACTGTGTCAA AAGAAACACA CATGTTTCTT TTCCACTCGA 180 
GAAAACGTTA TTCACCCCTG CGAAAAACGT CTCTTTGTTC AACCAAGTCC TTGANTCGAC 24 0 
GGCTCAAAAT CTCCAGTTCT TGGCAAAATC CATGCCTAAA CCGGGRTTCA TATTCAGACC 3 00 
GATTC AC C AG TCTCAAGTCC AAGSTTCCAT CATTTGTTCA AMGRAACTCG GGNTTCATTT 360 
TNGTGTTTGA NGTGGCGGTC ACGATTTTCG AGGC CTTTGT NTTTATGTTT CACGGTTTGA 42 0 
AAAAAC CGTT TATATTACTC GGCCTGTCAA ANTTGNANNC AAAATCANAT GTTGGATATT 480 
GNATTC C AAA TAGGTNCTTG GGGTNAACCT GGTGGCTANC GTTTGGTGAG CTTTTACTTT 540 
CAAGAATTTG CANGNGGANG TGCAAAGATT CCATGGGATT TCCCGGGGGG TTTNTTGCAC 600 

605 

AATGT 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 



(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.-464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 40: 
AACACAAAAC TCTTCCATTT GGCTTCTCTC TTGCATATTC GTTGCAAGGA TGGGAAATTC 6 0 
AAAACCACTC CCTACAATTN CTTGTATTAT CGTTTCAGTC TTGTATTTTN NATTCTATTG 12 0 
CATAACACCA ACTTCTTCAT CAGCCTCCAT CCAAGNTCAA TTCATAAACT GTGTCAAAAG 180 
GAACACACAT GTTTCTTTTC CACTCGAGNA AACGGTATTC ACTCCTGCGG AAAACGGCTC 24 0 
TNTTATTCAA CGGGTCCNTG AATCGACGGG TCAAAATCTC CAGTTCTTGG NAAAATCCAT 3 00 
GNCTAAACCG GGGTTCATAT TCAGGCCGGT TCACCAGTCT CAAGTCCAAG NTTCCATCAT 36 0 
TTGTTCAAAG GAACTCGGGA TTCATTTCCG CGNTAGAAGT GGCGGGCANN GGTTTCGGGG 42 0 
CCTGTCTNTT GNTTANGGGN AGGAAAACCG GTTNTATTNC TCGG 464 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..386 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
TCGGGAGCCC ANGNTAAATT ANNTGAAAAT GGGGNCGNAT ANCCGTTTAC NGAATTTTAT 60 
GACNCCCAAT ATGTTTCGAA ATCTCAAAGA NNGGGANCTT ATGTCAATTT CAAGGATATG 120 
GATTTGGGTA TGTATCTTGG AAAGNAGAAG ACAAAGTACG AGGAAGGAAA GAGTTGGGGA 180 
GTGAAGTATT TCAAGAACAA TTTCGAGAGA TTGGTGAGAG TGAAGACTAG GGTTGATCCN 240 
ACAGATTTCN TCTGCGATGA ACAGAGCATT CCTCTGGTGN ACAAAGTTAC CTGAAGATAT 300 
CATTTGAAGT TTTTTATTAG TCCCTTTTCT CTGTGAAATC ATCTGTGCGT GTTGAATANT 360 



ATGCGTCAAG TGTGTAACTT ATGTGT 3 86 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia' 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TACCATAGGG AGGTGGTGNA AGATTTTGTA TGTAGNCTTA GGGGAAGGCG AGTAGTATGG 60 
TGGTGGTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 120 
TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA CAGATTGGTG 180 
AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG TATCCCACCT 240 
ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT ATCAAAAATA AGAATAAATG 3 00 
ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC ACTACTCTAA 360 

377 

ATAAAATGTC ATTTACC 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 



(A) NAME/KEY: CDS 

(B) LOCATION: 1..377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

TACCATAGGG AGGTGGTGNA AGATTTTGTA TGTAGNCTTA GGGGAAGGCG AGTAGTATGG 6 0 

TGGTG GTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 12 0 

TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA CAGATTGGTG 18 0 

AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG TATCCCACCT 24 0 

ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT ATCAAAAATA AGAATAAATG 3 00 

ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC ACTACTCTAA 360 



ATAAAATGTC ATTTACC 



377 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

. (A) NAME /KEY : CDS 
(B) LOCATION: 2.. 346 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GAGCTGTGGA TATGGTCACA AATGGCAATC GGTTGGTCCG AAAACTGATC CGAATCTTTT 60 

TATGAGAATN TTGATTCAAC CAGTGACGAG GAAGAAGGTA AAGACTGTGA GAGCTTCTNT 120 

GGTTGCCCTN TTTTNAGGCN AGACAGATGA AGTTTTTGCT TTCCTTAGTA AGGAGTTTCC 180 

TGAATTGGGT TTAAAGAAGG AGAATTNTTC GGAGATGACT TGGTTTCANT CTGCTTTATG 240 

GTGGGACAAT CGTCTTAATG CTACTCAGGT TGATCCTAAA GTNTTTCTTG ATCGGAATCT 300 

CGATACCTCG AGTTTCGGTA AGAGGAAATC GGATTACGTC GCGACT 346 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
( i i i ) HYPOTHET ICAL : NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 261 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
ATGGGGTGAG ACTTATTTCA AAGGTAATTT CAAGAGATTA GGTTTGGTTA AAGGGAAGNT 6 0 
TGATCCAACA AATTTCTTCA GGAACGAACA GAGTATTCCT CCTCTGTTTT GAGTCCTCAA 12 0 
TACAAAACCA GATATAAAAG ATGTCATTTC ATTTTTTCAA TTATAATAGA TAATGTAACT 180 
TTCTGCTACA ATTGTAAAAG TGAGATGTAC CCAATACGGT TTAAGCGGAC CGAGAATAGT 240 

261 

CAATTCAAAG AC CAAATTCT G 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..478 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GCTCAAAGGA CTAACCATGA AAACTTCCTC AAGTGTCTCT CTCACCGANT CAACGAGGAC 60 
GACTCAAGAN TTATACACAC ATCAAAAGAT CCTTCGTATT TNTCAATCTT GATTTCTTCC 120 
ATACAAAATC CAAGTTTCTC TGTTCTCGAA ACACCTAAAC CGGTTTCAAT CATCACTCCG 18 0 



GTTCAAGCCA CCGATGTTCA ATCTACGNTT AAATNCGCAC GGNCTTCACG GGTATACACA 240 
ATCAGGGCTA GGAGTGGTNG TCATGACTAC GGAGGTTTAT CTTTACATTG GCTTAAAAAN 3 00 
CANNCCGTTC GTTNNTCATT GATTTNNAGA AATCTTCCGG GCTTATTTAA CATNTAAGAT 360 
GTTTGATAAN CCGGNNCCNG TTTGGGGTTC AAATCCCGGT GGCTTACAAA NTTNGGGGGA 42 0 
ATTGTNCCTA TGAGGTTTGG AAAATTAANG CAAAATNTTT TGGGCCTTCC CGGCCGGT 478 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
( i i i ) HYPOTHET I C AL : NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2 . . 579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GGCCGTTAGG AT CAT C AAG A AATGGCAATA TGCTGCAGAT AAGGTTCCTG ATGATCTTTT 60 
CATTAGGACA ACATTGGAGA GATCAAACAA GAACGCAGTA CACGCTTTGT TCACTGGACT 12 0 
ATATATTGGT CCGGTGAACA ATCTATTGGC GTTGATGGAA GAAAAGTTTC CGGAACTAGG 180 
TCTTGAGAAA GAAGGTTGTG AAGAGATGAG TTGGATTGAG TCTGTACTCT GGTTTGCTGA 24 0 
TTTCCCTAAA GGAGAATCTC TTGGTGTTCT CACGAATCGT GAGCGTACAT CTCTATCTTT 300 
CAAAGGCAAA GATGATTTTG TCCAAGAACC GATACCCGAG GCTGCAATTC AAGAGATATG 360 
GAGGCGATTA GAAGCCCCCG AGGCTCGGCT TGGAAAGATC ATATTAACTC CATTTGGGTG 42 0 
NGGNAAAATG AGTGAAATGG CAGAGNCCGA ACCACCAATT CCCACANNCG AGGGAGGGGA 48 0 
ACCCCTNTGN GGNTCAGAAT GTGGTTCCTG GNNNNNAAGN GGGNGCCAGN ACCAANCCGG 54 0 
GNCNGTAAAN CNTGNAATGG GCCNAACCCG TNCCGGATT 579 

(2) INFORMATION FOR SEQ ID NO : 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 52 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : -double 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi)- ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(B) STRAIN: Nipponbare, subsp . japonica 

(D) DEVELOPMENTAL STAGE: etiolated shoot (8 days old) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 2 52 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
TGTCCTGGAA GGTCCGCCTC GTGCAGGTTN CGACGACGGT GACGGTGTTC GTCGTCGGGA 6 0 
GGAACGTCGA CCAGGGCGCC GCNGACGTCG TCGCCAGATG GCAAGACGTC GCGCCGAGCC 12 0 
TCCCTCCCGA GCTCACCATA CGGGTGATCG TNCGAGGGCA GCGCGCCACG TTCCAGTCGC 18 0 
TGTACCTCGG CTCGTGCGCC G AC CTGGTGC CGACGATGAG CAGCATGTTC CCGGAGCTCG 240 
GGATGACGAT TG 252 



(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= Ambiguous 
/note- "Xaa = Cys or Ser" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 20.. 21 

(D) OTHER INFORMATION: /label= ambiguous 

/note= "Xaa-Xaa probably is Ser-Phe" 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Xaa Leu Asn Asn Arg 
15 10 15 

Ala Asp Pro Xaa Xaa 
20 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Ser" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= unknown 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Ser" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Trp" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 24 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Tyr" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Xaa He Xaa Val Xaa He Glu Asp Glu Thr Ala Xaa Val Gin Ala Gly 



Ala Thr Leu Gly Glu Val Tyr Xaa 
20 



(2) INFORMATION FOR SEQ ID NO ; 51: * - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Tyr Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ACTTCTACTT CTATTATTGA TAGGTTTACT CA 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 - .405 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



ACT TCT ACT TCT ATT ATT GAT AGG TTT ACT CAA TGT CTA AAC AAC CGA 4 8 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
15 10 15 

GCT GAC CCT TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC 96 
Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
20 25 30 

TCC TCT TTT CCA TCC GTC TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC 144 
Ser Ser Phe Pro Ser Val Leu Gin Ala Tyr He Arg Asn Leu Arg Phe 
35 40 .45 

AAT GAA TCC ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC 192 
Asn Glu Ser Thr Thr Pro Lys Pro He Leu He He Thr Ala Leu His 
50 55 60 

CCT TCA CAC ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG 240 
Pro Ser His He Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu 
65 70 75 80 

CTA ATG AAA ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCC TAT 2 88 
Leu Met Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr 
85 90 95 

GTG ACC AAT TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA 33 6 
Val Thr Asn Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu 
100 105 HO 

CGC TCC ATA AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCC 3 84 
Arg Ser He Asn Val Ser He Glu Asp Glu Thr Ala Trp Val Gin Ala 
115 120 125 



GGC GCC ACC CTC GGA GAA GTT 
Gly Ala Thr Leu Gly Glu Val 
130 135 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Thr Ser Thr Ser He He Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
1 5 10 15 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
20 25 ' 30 

Ser Ser Phe Pro Ser Val Leu Gin Ala Tyr He Arg Asn Leu Arg Phe 
35 40 45 

Asn Glu Ser Thr Thr Pro Lys Pro He Leu He He Thr Ala Leu His 
50 55 60 



405 



Pro Ser His lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu 
65 70 ' 75 80 



Leu Met Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr 
85 90 95 

Val Thr Asn Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu 
100 105 110 

Arg Ser lie Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala 
115 120 125 

Gly Ala Thr Leu Gly Glu Val 
130 135 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CACGTTTATG GAGCGTAAGT TGAAC 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
CACCCTTCAC ACATTCAAGC AGC 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7 . . 1626 

(ix) FEATURE : 

(A) NAME /KEY : unsure 

(B) LOCATION: 3 72 

(D) OTHER INFORMATION: location 372 may be "C" or "G" 



(ix) FEATURE: 

(A) NAME/KEY: unsure 
' (B) LOCATION: 379 
(D) OTHER INFORMATION: location 379 may be "A" or "G" 



(ix) FEATURE: 

(A) NAME / KEY : unsure 

(B) LOCATION: 786 

(D) OTHER INFORMATION: location 786 may be "C" or "T" 



(ix) FEATURE: 

(A) NAME / KEY : unsure 

(B) LOCATION: 1105... 1106 

(D) OTHER INFORMATION: location 1105... 1106 may be "AG", 

"GA" , "GG" or "AA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ACAAAA ATG GCA ATT ACC TAT TCT TTC AAC TTC AAA TCT TAT ATT TTT 4 8 

Met Ala He Thr Tyr Ser Phe Asn Phe Lys Ser Tyr He Phe 
! 5 10 

CCT CTC CTC CTT GTC TTG CTC TCT ACC CAT TCA TCA GCG ACT TCA ACT 96 
Pro Leu Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr 
15 20 25 30 

TCC ATT ATA GAT CGC TTC ACC CAA TGT CTA AAC AAC CGA GCT GAC CCT 144 
Ser He He Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro 

35 40 45 

TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC TCC TCT TTT 192 
Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe 
50 55 60 

CCA TCC GTC TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC AAT GAA TCC 240 
Pro Ser Val Leu Gin Ala Tyr He Arg Asn Leu Arg Phe Asn Glu Ser 
65 70 75 

ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC CCT TCA CAC 288 
Thr Thr Pro Lys Pro He Leu He He Thr Ala Leu His Pro Ser His 
80 85 90 

ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG CTA ATG AAA 336 
He Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys 
95 100 105 HO 



ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCS TAT GTG RCC AAT 3 84 
Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Val Xaa Asn 
115 120 125 

TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA CGC TCC ATA 432 
Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser He 
130 135 140 

AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCT GGT GCG ACT 48 0 
Asn Val Ser He Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr 
145 150 155 

CTT GGT GAA GTC TAC TAC CGA ATA GCA GAG AAA AGC AAC AGT CAT GCT 52 8 
Leu Gly Glu Val Tyr Tyr Arg He Ala Glu Lys Ser Asn Ser His Ala 
160 165 170 

TTT CCG GCT GGC GTT TGC CCT ACT GTT GGA GTT GGT GGC CAT TTT AGT 576 
Phe Pro Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser 
175 180 185 190 

GGT GGT GGT TAT GGT AAC TTG ATG GGA AAA TAC GGC CTT TCT GTT GAC 624 
Gly Gly Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp 
195 200 205 

AAT ATT GTC GAT GCT CAG TTA ATC GAT GTG AAT GGT AAA CTT CTG AAT 672 
Asn He Val Asp Ala Gin Leu He Asp Val Asn Gly Lys Leu Leu Asn 
210 215 220 

CGG AAA TCA ATG GGT GAA GAT CTT TTT TGG GCC ATC ACA GGT GGT GGT 72 0 
Arg Lys Ser Met Gly Glu Asp Leu Phe Trp Ala He Thr Gly Gly Gly 
225 230 235 

GGT GTC AGC TTT GGT GTG GTT GTA GCG TAC AAG ATC AAA CTG GTT CGT 76 8 
Gly Val Ser Phe Gly Val Val Val Ala Tyr Lys He Lys Leu Val Arg 
240 245 250 

GTT CCT ACC ACT GTG ACY GTT TTT AAC GTA CAA AGA ACA TCC GAG CAG 816 
Val Pro Thr Thr Val Thr Val Phe Asn Val Gin Arg Thr Ser Glu Gin 
255 260 265 270 

AAC CTA AGC ACC ATA GCC CAC CGA TGG ATA CAA GTT GCG GAT AAG CTC 864 
Asn Leu Ser Thr He Ala His Arg Trp He Gin Val Ala Asp Lys Leu 
275 280 285 

GAT AAT GAC CTT TTC CTT CGA ATG ACC TTT AAC GTG ATA AAC AAC ACA 912 
Asp Asn Asp Leu Phe Leu Arg Met Thr Phe Asn Val He Asn Asn Thr 
290 295 300 

AAT GGC GAA AAG ACG ATA CGT GGT TTG TTT CCA ACA CTG TAC CTC GGA 96 0 
Asn Gly Glu Lys Thr He Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly 
305 310 315 

AAC TCT ACC GCT CTT GTT GCC CTC CTG AAC AAG GAT TTC CCT GAA TTA 100 8 
Asn Ser Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu 
320 325 330 

GGT GTA GAA ATT TCA GAT TGT ATT GAA ATG AGT TGG ATC GAG TCT GTT 1056 
Gly Val Glu He Ser Asp Cys He Glu Met Ser Trp He Glu Ser Val 
335 340 345 350 



CTT TTC TAC ACA ' AAC TTC CCC ATT GGT ACT CCG ACC ACT GCT CTT CTA 1104 
Leu Phe Tyr Thr Asn Phe Pro He Gly Thr Pro Thr Thr Ala Leu Leu 
355 360 365 

RRC CGT ACA CCT CAR. AGA CTA AAC CCA TTC AAA ATC AAA TCT GAT TAC 1152 
Xaa Arg Thr Pro Gin Arg Leu Asn Pro Phe Lys He Lys Ser Asp Tyr 
370 375 380 

GTA AAA AAC ACT ATT TCC AAA CAG GGA TTC GAA TCC ATA TTT GAA AGG 1200 
Val Lys Asn Thr He Ser Lys Gin Gly Phe Glu Ser He Phe Glu Arg 
385 390 395 

ATG AAA GAA CTC GAA AAC CAA ATG CTA GCT TTC AAC CCT TAT GGT GGA 1248 
Met Lys Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly 
400 405 410 

AGA ATG AGC GAA ATT TCC GAA TTT GCA AAG CCT TTT CCC CAT CGA TCA 1296 
Arg Met Ser Glu He Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser 
415 420 425 430 

GGG AAT ATA GCG AAG ATC CAA TAC GAA GTA AAC TGG GAT GAA CTT GGC 1344 
Gly Asn He Ala Lys He Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly 
435 440 445 

GTT GAA GCA GCC AAT CGG TAC TTG AAC TTC ACA AGG GTG ATG TAT GAT 1392 
Val Glu Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp 
450 455 460 

TAT ATG ACT CCG TTT GTT TCT AAG AAC CCC AGG GAA GCA TTT CTG AAC 1440 
Tyr Met Thr Pro Phe Val Ser Lys Asn Pro Arg Glu Ala Phe Leu Asn 
465 470 475 

TAC AGG GAT TTA GAT ATT GGT GTC AAC AGT CAT GGC AAG AAT GCT TAC 1488 
Tyr Arg Asp Leu Asp He Gly Val Asn Ser His Gly Lys Asn Ala Tyr 
480 485 490 

GGT GAA GGA ATG GTT TAT GGG CAC AAG TAT TTC AAA GAG ACG AAT TAT 1536 
Gly Glu Gly Met Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr 
49 J 500 505 510 

AAG AGG CTA ACG ATG GTG AAG ACG AGG GTT GAT CCT AGC AAT TTT TTT 1584 
Lys Arg Leu Thr Met Val Lys Thr Arg Val Asp Pro Ser Asn Phe Phe 
515 520 525 

AGG AAT GAG CAA AGT ATC CCA ACT TTG TCA TCT TCA TGG AAG 1626 
Arg Asn Glu Gin Ser He Pro Thr Leu Ser Ser Ser Trp Lys 
530 535 540 

TAAATTCTAA ATTCACTTGT GAAATTGAAT AAAAGTATGG CTTTTTCAAG GTCATGGTAT 1686 

C CAG ATT CAG ATGATATTGA TATAATTTTG ACTTGTATTT ATACAAACAA AATTATATTA 1746 

TATTTTTCTG AATTTAGATT TTCCATTCTT TGGAAAAATA TACGAACATT GATGTTGATA 180.6 

TTTTTAAGAA TTATAGATTT TGAACATTGT GAACAATGAA TAAACCGAGG ACTTCCCTTG 1866 

GGTTTTTTTT ATAAGTATGT AATAGCATGT CTTTAATCAA GATAACCGAT CATTGGATGC 1926 

AATTTATTAT TATAAACCTT ATTTAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA 1981 



(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr He Phe Pro Leu 
1 5 10 15 

Leu Leu val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr Ser He 
20 25 30 

He Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro Ser Phe 
35 40 45 

Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe Pro Ser 
50 55 60 

Val Leu Gin Ala Tyr He Arg Asn Leu Arg Phe Asn Glu Ser Thr Thr 
65 70 75 80 

Pro Lys Pro He Leu He He Thr Ala Leu His Pro Ser His He Gin 

85 90 95 

Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys Thr Arg 
100 105 HO 

Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Val Thr Asn Ser Asn 



115 



120 



125 



Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser He Asn Val 
130 135 140 

Ser He Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr Leu Gly 
145 150 155 160 

Glu Val Tyr Tyr Arg He Ala Glu Lys Ser Asn Ser His Ala Phe Pro 
165 l™ 175 

Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly 
180 185 190 

Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp Asn He 
195 200 205 

Val Asp Ala Gin Leu He Asp Val Asn Gly Lys Leu Leu Asn Arg Lys 



Asp 
210 

Ser Met 



215 220 



Gly Glu Asp Leu Phe Trp Ala He Thr Gly Gly Gly Gly Val 
225 230 235 240 



Ser Phe Gly Val Val Val Ala Tyr Lys He Lys Leu Val Arg Val Pro 
245 250 255 

Thr Thr Val Thr Val Phe Asn Val Gin Arg Thr Ser Glu Gin Asn Leu 
260 265 270 



Ser Thr He Ala His Arg Trp He Gin Val Ala Asp Lys Leu Asp Asn 
275 280 285 



Asp Leu Phe Leu Arg Met Thr Phe Asn Val He Asn Asn Thr Asn Gly 
290 295 300 

Glu Lys Thr He Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly Asn Ser 
305 310 315 320 

Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Val 
325 330 335 

Glu He Ser Asp Cys He Glu Met Ser Trp He Glu Ser Val Leu Phe 
340 345 350 

Tyr Thr Asn Phe Pro lie Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg 
355 360 365 

Thr Pro Gin Arg Leu Asn Pro Phe Lys He Lys Ser Asp Tyr Val Lys 
370 375 380 

Asn Thr He Ser Lys Gin Gly Phe Glu Ser He Phe Glu Arg Met Lys 
385 390 395 400 

Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly Arg Met 
405 410 415 

Ser Glu He Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn 
420 425 430 

He Ala Lys He Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly Val Glu 
435 440 445 

Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp Tyr Met 
450 455 460 

Thr Pro Phe Val Ser Lys Asn Pro Arg Glu Ala Phe Leu Asn Tyr Arg 
465 470 475 480 

Asp Leu Asp He Gly Val Asn Ser His Gly Lys Asn Ala Tyr Gly Glu 
485 490 495 

Gly Met Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr Lys Arg 
500 505 510 

Leu Thr Met Val Lys Thr Arg Val Asp Pro Ser Asn Phe Phe Arg Asn 
515 520 525 

Glu Gin Ser He Pro Thr Leu Ser Ser Ser Trp Lys 
530 535 540 



(2) INFORMATION FOR SEQ ID NO : 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GGTAATGATC TCCTTTCTTG TTTGACC 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
AGAGCGGCCG CTATATTACA ACTTCTCCAC CATCACTCCT C 



(2) INFORMATION FOR SEQ ID NO : 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GGTGATGTTA ATGATAATCT CCTC 

(2) INFORMATION FOR SEQ ID NO : 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 62 
AGAGCGGCCG CTACAATTCC TTCAACATGT AAATTTCCTC 

(2) INFORMATION FOR SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
ACTTCCCGTA GAAACTCGGA GACTTTCACA CAATGC 



(2) INFORMATION FOR SEQ ID NO : 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
TCCATCCAAG ATCAATTCAT AAACTGTGTC 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 
AGAGCGGCCG CTTTCATGAA CCTAGCTTCT AGTAGG 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



AGAGCGGCCG CG AAATGGC C CCCCTTTTAA AACGGGG 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AGAGCGGCCG CAAATGATAT CTTCAGGTAA CTTTGTTCAC 



(2) INFORMATION FOR SEQ ID NO: 6.8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
AGAGCGGCCG CATAATCAAA TAAATACACT TATGGTAACA CAG 



(2) INFORMATION FOR SEQ ID NO : 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 69: 
AGAGCGGCCG CTGGTTTTGT ATTGAGGACT CAAAACAG 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1757 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: join (1 .. 570 , 801.. 1754) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 48 
Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 
Ser Asp Pro Lys His Pro He Ser Pro Ala He Phe Phe Ser Gly Asn 
20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 
Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn He Arg Asn Leu Arg Phe 
35 40 45 

AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 
Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu He He Ala Ala Thr His 
50 55 60 

GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 240 
Glu Ser His Val Gin Ala Ala He Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

GAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 288 
Gin Met Lys He Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 9° 95 

GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC'CTC 336 
Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 HO 

CGT TCG GTG GAT GTC GAT GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 384 
Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 I 25 

GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 432 
Gly Ala He Leu Gly Glu Val Tyr Tyr Tyr He Trp Glu Lys Ser Lys 
130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 480 
Thr Leu Ala Tyr Pro Ala Gly He Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 



* CAT. ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAC GGT CTC 528 
His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

ACC GTA GAT AAT ACC ATC GAT GGA AGA ATG GTC GAC GTT AAT 570 
Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn 
180 185 190 

GGTATAATTG ATATCTCTAT TTTATATACT AATTAAATTT TATAGTGTGG ATCGGATAGT 630 

GATTTTGGTC CATCAATTAA AAACTTGGTG AACATAAAAT TAACCAAGCA ATCAATTTAG 690 

ACAAGCAACA TAATCATATA TATTTTTCTT ACATTTGTAT GTACCTGAAT ATTTATATTT 750 

ATGTTTATAT GTTCTCACTA TATTTTCACT TTTGTATTTG AAAATTTTTA GGA AAA 806 

Gly Lys 

ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAC TGG GCA ATA AAC 854 
lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

GGA GGA GGA GGA GGG AGC TAC GGC GTC GTA TTG GCC TAC AAA ATA AAC 902 
Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys He Asn 
210 215 220 

CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 950 
Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg He Ser Arg Thr 
225 230 235 240 

TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 998 
Leu Glu Gin Asn Ala Thr Asp He He His Arg Trp Gin Gin Val Ala 
245 250 255 

CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAC GTA GTA 1046 
Pro Lys Leu Pro Asp Glu Leu Phe He Arg Thr Val He Asp Val Val 
260 265 270 

AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 1094 
Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe He Ala 
275 280 285 

ATG TTT CTA GGA GAC ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 1142 
Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser He Leu Asn Arg Arg 
290 295 300 

TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 1190 
Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 1238 
He Gin Ser Val Leu Phe Trp Thr Asn He Gin Val Gly Ser Ser Glu 
325 330 335 

t 

ACA CTT CTA CTC CAA AGG AAT CAA CCC GTG AAC TAC CTC AAG AGG AAA 1286 
Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

TCA GAT TAC GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 1334 
Ser Asp Tyr Val Arg Glu Pro He Ser Arg Thr Gly Leu Glu Ser He 
355 360 365 



TGG" AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 13 82 
Trp Lys Lys Met He Glu Leu Glu He Pro Thr Met Ala Phe Asn Pro 
370 375 380 

TAC GGT GGT GAG ATG GGG AGG ATA TCA TTA CGG GTG ACT CCG TTC CCA 1430 
Tyr Gly Gly Glu Met Gly Arg He Ser Leu Arg Val Thr Pro Phe Pro 
385 390 395 400 

TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 1478 
Tyr Arg Ala Gly Asn Leu Trp Lys He Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 1526 
Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 . 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG.TTT TTC 1574 
Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT AAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 162 2 
Asn Asn Arg Asp Val Asp Leu Gly He Asn Ser His Asn Gly Lys He 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 1670 
Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 1718 
Asn Phe Glu Arg Leu Val Lys He Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 



TTC TTT AGG AAC GAA CAC AGT ATT CCT GTG TTA CCA TAA 
Phe Phe Arg Asn Glu His Ser He Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
1 5 10 15 

Ser Asp Pro Lys His Pro He Ser Pro Ala He Phe Phe Ser Gly Asn 
20 25 30 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn He Arg Asn Leu Arg Phe 
35 40 45 

Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu He He Ala Ala Thr His 
50 55 60 



1757 



Glu Ser His" Val Gin Ala Ala He Thr Cys Gly Lys Arg His Asn Leu 
65 ' 70 75 80 

Gin Met Lys He Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 HO 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
U5 120 125 

Gly Ala He Leu Gly Glu Val Tyr Tyr Tyr He Trp Glu Lys Ser Lys 
130 135 140 

Thr Leu Ala Tyr Pro Ala Gly He Cys Pro Thr Val Gly Val Gly Gly 
145 150 . 155 160 

His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

Thr Val Asp Asn Thr He Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

He Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala He Asn 



195 



200 



205 



Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys He Asn 
210 215 220 

Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg He Ser Arg Thr 
225 230 235 240 

Leu Glu Gin Asn Ala Thr Asp He He His Arg Trp Gin Gin Val Ala 
245 250 255 

Pro Lys Leu Pro Asp Glu Leu Phe He Arg Thr Val He Asp Val Val 
260 265 270 

Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe He Ala 



275 



280 285 



Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser He Leu Asn Arg Arg 
290 295 300 

Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

He Gin Ser Val Leu Phe Trp Thr Asn He Gin Val Gly Ser Ser Glu 
325 330 335 

Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

Ser Asp Tyr Val Arg Glu Pro He Ser Arg Thr Gly Leu Glu Ser He 
355 360 365 

Trp Lys Lys Met He Glu Leu Glu He Pro Thr Met Ala Phe Asn Pro 
370 375 380 



Tyr Gly Gly Glu Met Gly Arg lie Ser Leu Arg Val Thr'Pro Phe Pro 
385 390 395 400 

Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

Asn Asn Arg Asp Val Asp Leu Gly He Asn Ser His Asn Gly Lys He 
450 455 460 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

Asn Phe Glu Arg Leu Val Lys He Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

Phe Phe Arg Asn Glu His Ser He Pro Val Leu Pro 
500 505 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1524 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 4 8 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 
Ser Asp Pro Lys His Pro He Ser Pro Ala He Phe Phe Ser Gly Asn 
20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 
Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn He Arg Asn Leu Arg Phe 
35 40 45 



AAC ACC ACC TCA ACT CCG 'AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 
Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu He lie Ala Ala Thr His 
50 55 60 

GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 24 0 
Glu Ser His Val Gin Ala Ala He Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 28 8 
Gin Met Lys He Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 336 
Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

CGT TCG GTG GAT GTC GAC GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 384 
Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 432 
Gly Ala He Leu Gly Glu Val Tyr Tyr Tyr He Trp Glu Lys Ser Lys 
130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 48 0 
Thr Leu Ala Tyr Pro Ala Gly He Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 

CAT ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAC GGT CTC 52 8 
His He Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTA AAT GGA AAA 576 
Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAC TGG GCA ATA AAC 624 
He Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala He Asn 
195 200 205 

GGA GGA GGA GGA GGG AGC TAC GGC GTC GTA TTG GCC TAC AAA ATA AAC 672 
Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys He Asn 
210 215 220 

CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 72 0 
Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg He Ser Arg Thr 
225 230 235 240 

TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 768 
Leu Glu Gin Asn Ala Thr Asp He He His Arg Trp Gin Gin Val Ala 
245 250 255 

CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAC GTA GTA 816 
Pro Lys Leu Pro Asp Glu Leu Phe He Arg Thr Val He Asp Val Val 
260 265 270 

AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 864 
Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe He Ala 
275 280 285 



ATG TTT CTA GGA GAC ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 912 
Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser He Leu Asn Arg Arg 
290 295 300 

TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 960 
Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 1008 
He Gin Ser Val Leu Phe Trp Thr Asn He Gin Val Gly Ser Ser Glu 
325 330 335 

ACA CTT CTA CTC CAA AGG AAT CAA CCC GTG AAC TAC CTC AAG AGG AAA 1056 
Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

TCA GAT TAC GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 1104 
Ser Asp Tyr Val Arg Glu Pro He Ser Arg Thr Gly Leu Glu Ser He 



355 



360 365 



TGG AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 1152 
Trp Lys Lys Met He Glu Leu Glu He Pro Thr Met Ala Phe Asn Pro 



370 



375 



TAC GGT GGT GAG ATG GGG AGG ATA TCA TCT ACG GTG ACT CCG TTC CCA 1200 
Tyr tly Gly Glu Met Gly Arg He Ser Ser Thr Val Thr Pro Phe Pro 



385 



390 



TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 1248 
Tyr Arg Ala Gly Asn Leu Trp Lys He Gin Tyr Gly Ala Asn Trp Arg 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 1296 
Sp Su Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 1344 
CAA TTC ATG At-A i ger phe phe 

Gin Phe Met Thr Pro Phe Val Ser Lys asn «u a 

435 440 445 

AAT TAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 1392 
£1 Syr Arg Asp Val Asp Leu Gly He Asn Ser His Asn Gly Lys He 

46 0 



450 



455 



AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 1440 
ill Ser lyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 1488 
A^n Phe Glu Arg Leu Val Lys He Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

TTC TTT AGG AAC GAA CAG AGT ATT CCT GTG TTA CCA TAA 1527 
Phe Phe Arg Asn Glu Gin Ser He Pro Val Leu Pro 
500 505 

(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 508 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
x 5 10 15 



Ser Asp Pro Lys His Pro He Ser Pro Ala He Phe Phe Ser Gly Asn 
20 25 30 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn He Arg Asn Leu Arg Phe 



35 



40 



Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu He He Ala Ala Thr His 
50 55 60 

Glu Ser His Val Gin Ala Ala He Thr Cys Gly Lys Arg His Asn Leu 
65 70 ™ 80 

Gin Met Lys He Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 9° 95 

Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

Gly Ala He Leu Gly Glu Val Tyr Tyr Tyr He Trp Glu Lys Ser Lys 
130 135 I 40 

Thr Leu Ala Tyr Pro Ala Gly He Cys Pro Thr Val Gly Val Gly Gly 

His He Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 I 70 175 

Thr Val Asp Asn Thr He Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala He Asn 



195 



200 205 



Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys He Asn 



210 



215 220 



Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg He Ser Arg Thr 
225 230 235 240 



Leu Glu Gin Asn Ala Thr Asp He He His Arg Trp Gin Gin Val Ala 
245 250 255 

Pro Lys Leu Pro Asp Glu Leu Phe He Arg Thr Val He Asp Val Val 
260 265 270 

Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe He Ala 
275 280 285 



Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser He Leu Asn Arg Arg 
290 295 300 

Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 



Thr Leu 

340 



Ser Asp Tyr Val Arg Glu Pro lie Ser 
355 



lie Gin Ser Val Leu Phe Trp Thr Asn He Gin Val Gly Ser Ser Glu 
325 330 335 

Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 

350 

Arg Thr Gly Leu Glu Ser He 
360 365 

Trp Lys Lys Met He Glu Leu Glu He Pro Thr Met Ala Phe Asn Pro 
370 375 380 

Tyr Gly Gly Glu Met Gly Arg He Ser Ser Thr Val Thr Pro Phe Pro 
385 390 395 

Tyr Arg Ala Gly Asn Leu Trp Lys He Gin Tyr Gly Ala Asn Trp Arg 



405 



Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 



Asp 

420 



425 430 



Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

Asn Tyr Arg Asp Val Asp Leu Gly He Asn Ser His Asn Gly Lys He 
450 455 460 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 



Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 
485 



490 495 



Phe Phe Arg Asn Glu Gin Ser lie Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO : 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1527 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 



48 



96 



TCC ATC CAA GAT CAA TTC ATA AAC TGT GTC AAA AGA AAC ACA CAT GTT 
Ser He Gin Asp Gin Phe He Asn Cys Val Lys Arg Asn Thr His Val 
1 5 10 15 

TCT TTT CCA CTC GAG AAA ACG TTA TTC ACC CCT GCG AAA AAC GTC TCT 
Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 25 30 

TTG TTC AAC CAA GTC CTT GAA TCG ACG GCT CAA AAT CTC CAG TTC TTG 144 
Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
35 40 45 

GCA AAA TCC ATG CCT AAA CCG GGA TTC ATA TTC AGA CCG ATT CAC CAG 192 
Ala Lys Ser Met Pro Lys Pro Gly Phe He Phe Arg Pro He His Gin 
50 55 60 

TCT CAA GTC CAA GCT TCC ATC ATT TGT TCA AAG AAA CTC GGA ATT CAT 
Ser Gin Val Gin Ala Ser lie He Cys Ser Lys Lys Leu Gly He His 
65 70 75 80 

TTT CGT GTT AGA AGT GGC GGT CAC GAT TTC GAG GCC TTG TCT TAT GTT 
Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 
85 90 95 

TCA CGG ATT GAA AAA CCG TTT ATA TTA CTC GAC CTG TCA AAA TTG AAA 
Ser Arg He Glu Lys Pro Phe He Leu Leu Asp Leu Ser Lys Leu Lys 
100 1° 5 110 

CAA ATC AAT GTT GAT ATT GAA TCC AAT AGT GCT TGG GTT CAA CCT GGT 
Gin lie Asn Val Asp He Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 125 

GCT ACG CTT GGT GAG CTT TAC TAC AGA ATT GCA GAG AAG AGC AAG ATC 
Ala Thr Leu Gly Glu Leu Tyr Tyr Arg He Ala Glu Lys Ser Lys He 
130 135 140 

CAT GGA TTT CCC GCG GGT TTG TGC ACA AGT GTA GGC ATA GGT GGG TAT 480 
His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly He Gly Gly Tyr 
145 150 155 160 

ATG ACA GGC GGT GGA TAC GGT ACC TTG ATG AGG AAG TAT GGT CTT GCG 
Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 I 70 175 



240 



288 



336 



384 



432 



528 



GGA GAT AAT GTT CTA GAC GTA AAG ATG GTT GAT GCA AAT GGT AAA TTA 576 
Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 190 

CTC GAC AGA GCC GCG ATG GGT GAG GAC CTA TTT TGG GCG ATT AGA GGA 624 
Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala He Arg Gly 
195. 200 205 

GGC GGT GGA GCG AGT TTC GGG ATA GTT CTA GCA TGG AAG ATC AAG CTT 672 
Gly Gly Gly Ala Ser Phe Gly He Val Leu Ala Trp Lys He Lys Leu 
210 215 220 



GTT CCT GTT CCT AAG ACT GTT ACC GTC TTC ACT GTC ACC AAA ACG TTA 720 
Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235 240 

GAA CAA GAC GCA AGA TTG AAG ACT ATT TCT AAG TGG CAA CAA ATT TCA 768 
Glu Gin Asp Ala Arg Leu Lys Thr lie Ser Lys Trp Gin Gin lie Ser 
245 250 255 

TCC AAG ATT ATT GAA GAG ATA CAC ATC CGA GTG GTA CTC AGA GCA GCT 816 
Ser Lys He He Glu Glu He His He Arg Val Val Leu Arg Ala Ala 
260 265 270 

GGA AAT GAT GGA AAC AAG ACT GTG ACA ATG ACC TAC CTA GGT CAG TTT 864 
Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

CTT GGC GAG AAA GGC ACC TTG CTG AAG GTT ATG GAG AAG GCT TTT CCA 912 
Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

GAA CTA GGG TTA ACT CAA AAG GAT TGT ACT GAA ATG AGC TGG ATT GAA 960 
Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp He Glu. 
305 310 315 320 

GCC GCC CTT TTC CAT GGT GGA TTT CCA ACA GGT TCT CCT ATT GAA ATT 1008 
Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro He Glu He 
325 330 335 

TTG CTT CAG CTC AAG TCG CCT CTA GGA AAA GAT TAC TTC AAA GCA ACG 1056 
Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 345 350 

TCG GAT TTC GTT AAA GAA CCT ATT CCT GTG ATA GGC TTC AAA GGA ATA 1104 
Ser Asp Phe Val Lys Glu Pro He Pro Val He Gly Phe Lys Gly He 
355 360 365 

TTC AAA AGA TTG ATT GAA GGA AAC ACA ACA TTT CTG AAC TGG ACT CCT 1152 
Phe Lys Arg Leu He Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 

TAC GGT GGT ATG ATG TCG AAA ATC CCT GAA TCT GCG ATC CCA TTT CCG 1200 
Tyr Gly Gly Met Met Ser Lys He Pro Glu Ser Ala He Pro Phe Pro 
385 390 395 400 

CAT AGA AAC GGA ACC CTC TTC AAG ATT CTC TAT TAC GCG AAC TGG CTA 1248 
His Arg Asn Gly Thr Leu Phe Lys He Leu Tyr Tyr Ala Asn Trp Leu 
405 410 415 

GAG AAT GAC AAG ACA TCG AGT AGA AAA ATC AAC TGG ATC AAA GAG ATA 1296 
Glu Asn Asp Lys Thr Ser Ser Arg Lys He Asn Trp He Lys Glu He 
420 425 430 

TAC AAT TAC ATG GCG CCT TAT GTC TCA AGC AAT CCA AGA CAA GCA TAT 1344 
Tyr Asn Tyr Met Ala Pro Tyr Val Ser Ser Asn Pro Arg Gin Ala Tyr 
435 440 445 

GTG AAC TAC AGA GAT CTA GAC TTC GGA CAG AAC AAG AAC AAC GCA AAG 1392 
Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 
450 455 460 

GTT AAC TTC ATT GAA GCT AAA ATC TGG GGA CCT AAG TAC TTC AAA GGC 1440 



Val Asn Phe He Glu Ala Lys lie 
465 470 

AAT TTT GAC AGA TTG GTG AAG ATT 
Asn Phe Asp Arg Leu Val Lys He 
485 

TTC TTC AGG CAC GAG CAG AGT ATC 
Phe Phe Arg His Glu Gin Ser He 
500 



Trp Gly Pro Lys Tyr Phe Lys Gly 
475 480 

AAA ACC AAG GTT GAT CCA GAG AAC 14 8 8 
Lys Thr - Lys Val Asp Pro Glu Asn 
490 495 

CCA CCT ATG CCC TAC TAG 153 0 

Pro Pro Met Pro Tyr 

505 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 75: 

Ser lie Gin Asp Gin Phe He Asn Cys Val Lys Arg Asn Thr His Val 

n 10 15 



Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 



25 30 



Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
35 40 45 

Ala Lys Ser Met Pro Lys Pro Gly Phe He Phe Arg Pro He His Gin 



50 



55 60 



Ser Gin Val Gin Ala Ser He He Cys Ser Lys Lys Leu Gly He His 
65 70 75 80 

Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 
85 9° 95 

Ser Arg He Glu Lys Pro Phe He Leu Leu Asp Leu Ser Lys Leu Lys 



100 



105 



110 



Gin lie Asn Val Asp He Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 I 25 

Ala Thr Leu Gly Glu Leu Tyr Tyr Arg He Ala Glu Lys Ser Lys He 
130 135 "0 

His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly He Gly Gly Tyr 
145 150 155 160 

Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 170 175 

Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 19° 

Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala He Arg Gly 
195 200 205 



Gly Gly Gly Ala Ser Phe Gly lie Val Leu Ala Trp Lys He Lys Leu 
210 215 220 

Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235 240 

Glu Gin Asp Ala Arg Leu Lys Thr He Ser Lys Trp Gin Gin He Ser 
245 250 255 

Ser Lys He He Glu Glu He His He Arg Val Val Leu Arg Ala Ala 
260 265 270 

Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp He Glu 
305 310 315 320 

Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro He Glu He 
325 330 335 



Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 



345 350 



Ser Asp Phe Val Lys Glu Pro 
355 



He Pro Val He Gly Phe Lys Gly He 
360 365 

Phe Lys Arg Leu He Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 

Tyr Gly Gly Met Met Ser Lys He Pro Glu Ser Ala He Pro Phe Pro 
385 390 395 400 

His Arg Asn Gly Thr Leu Phe Lys He Leu Tyr Tyr Ala Asn Trp Leu 
An* 410 415 



405 

Glu Asn Asp Lys Thr Se 
420 



Tyr Asn Tyr Met Ala Pro Tyr 
435 



r Ser Arg Lys He Asn Trp He Lys Glu He 
425 430 

Val Ser Ser Asn Pro Arg Gin Ala Tyr 
440 445 



Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 



450 



455 460 



Val Asn Phe He Glu Ala Lys He Trp Gly Pro Lys Tyr Phe Lys Gly 
465 470 475 480 

Asn Phe Asp Arg Leu Val Lys He Lys Thr Lys Val Asp Pro Glu Asn 
485 490 495 

Phe Phe Arg His Glu Gin Ser He Pro Pro Met Pro Tyr 
500 505 



